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Abstract 

In this paper, we present a framework for the semantics and the computation of aggregates 
in the context of logic programming. In our study, an aggregate can be an arbitrary 
interpreted second order predicate or function. We define extensions of the Kripke-Kleene, 
the well-founded and the stable semantics for aggregate programs. The semantics is based 
on the concept of a three- valued immediate consequence operator of an aggregate program. 
Such an operator approximates the standard two- valued immediate consequence operator 
of the program, and induces a unique Kripke-Kleene model, a unique well-founded model 
and a collection of stable models. We study different ways of defining such operators and 
thus obtain a framework of semantics, offering different trade-offs between precision and 
tractability. In particular, we investigate conditions on the operator that guarantee that 
the computation of the three types of semantics remains on the same level as for logic 
programs without aggregates. Other results show that, in practice, even efficient three- 
valued immediate consequence operators which are very low in the precision hierarchy, 
still provide optimal precision. 

KEYWORDS: Logic Programming, Aggregates. 



1 Introduction 

This paper is a study of the semantics of an extension of logic programming with ag- 
gregates. Aggregates are specific second order functions or predicates ranging over 
sets. Standard examples are the minimum of a subset of a partially ordered domain, 
the cardinality of a set, the sum, the product and the average of a finite set of inte- 
gers or reals, etc. Aggregates play an important role in different areas. They are used 
and studied extensively in the context of databases (confer the group-by statement). 
They were introduced in the context of logic programming as declarative variants 
of the set_of and bag-of procedures QKemp and Stuckey 1991|fMumick et al. 1 990 ). 
Recently, they were introduced in the context of two extensions of logic program- 
ming, Answer Set Programming (jSimons et al. 2002|l and Abductive Logic Pro- 
gramming IjVan Nuffelen and Denecker 2000|l . Aggregates commonly show up in 
human expert knowledge and expressions of computational problems. For instance: 

• the query for the average result of students for some exam; 

• the property that the capacity of a room should exceed the number of students 
attending the course that takes place in that room; 
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• the cardinality constraint that a lecturer should not teach more than 6 courses; 

• the property or constraint that the sum of the capacities of available power 
generators in some electricity factory should exceed some given lower bound; 

These examples show that aggregates are important to express many forms of hu- 
man expert knowledge and computational problems in a direct and natural way. For 
this reason, aggregates likely will be part of computational logics and the languages 
of future knowledge based systems. 

We will study the semantics of sets of rules of the form 

A <— ip 

where A is an atomic formula and tp a logic expression possibly containing aggre- 
gate formulas. Such rule sets are a core part in logic programming and extensions 
such as abductive logic programming and extended logic programming, the sub- 
logic of answer set programming. Rule sets occur also as definitions of intensional 
predicates in deductive databases and as inductive definitions in the logic ID-logic, 
an extension of classical logic with generalized, non-monotone inductive definitions 
l|Denecker 200f)|l . Thus, the results of our study can be applied in the context of all 
these logics. 

In the context of logic programming, several extensions with aggregates were pro- 
posed for subclasses of the formalism that we consider here, in particular for mono- 
tone aggregate programs IjMumick et al. 1990l|Ross and Sagiv~1 997) and stratified 
aggregate programs l|DeirArmi et al. 20031 iMumick et al. 199f)jl . Our work extends 
such proposals in two ways. First, we consider more general rule sets with ar- 
bitrary recursion over aggregates. Second, we develop a framework of semantics 
including extensions of the three main semantics that have been used in logic 
programming: Kripke-Kleene semantics (i.e., three- valued completion semantics) 
| |Fitting 1985| ) , stable semantics ( Gelfond and Lifschit z 1988|l and the well-founded 
semantics l|Van Gelder et al. 19 91). 

The foundation of our work is the algebraic theory of approximating operators 
l|Denecker et al. 20001 IDenecker et al. 2004)) . Approximation theory is a fixpoint 
theory of non-monotone lattice operators. With any lattice operator O : L — > L, 
it associates a family of approximating operators A: L 2 — > L 2 on the product 
lattice L 2 . The fixpoint theory associates with every approximating operator A 
different types of fixpoints: a Kripke-Kleene fixpoint and a well-founded fixpoint, 
both in the bilattice L 2 and a set of A-stable fixpoints of O in the lattice L. 
In <|Denecker et al. 2000|l it was shown that the three- valued Fitting operator $p 
pitting 1985| ) is an approximation of the immediate consequence operator Tp of a 
logic program P and that the different types of fixpoints of <&p corresponds to the 
Kripke-Kleene, the well-founded and the stable models of P. 

In l|Denecker et al. 200411 . the class of approximations of O was further inves- 
tigated. The collection of approximations of a lattice operator O is ordered by a 
precision order. More precise approximations have a more precise Kripke-Kleene 
and well-founded fixpoint, and have more stable fixpoints. It was shown that O has 
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a most precise approximation, called the ultimate approximation of O which has 
the most precise semantics. 

In the context of logic programming, approximation theory induces a family of 
Kripke-Kleene, a family of well-founded and a family of stable semantics, generated 
by the class of approximations of the immediate consequence operator Tp. Basi- 
cally, each family formalizes similar intuitions but in different degrees of precision. 
In (Dcnec ker et al. 2004|l . the ultimate and the standard versions of these semantics 
are investigated. It follows from the general theory that the ultimate versions of the 
semantics are more precise than the standard semantics. Also, ultimate semantics 
have elegant semantic properties which do not always hold for the standard seman- 
tics based on the Fitting operator. For instance, substituting a rule body B by a 
formula B' which is equivalent with B in classical logic, is equivalence preserving 1 in 
the ultimate semantics but not in the standard semantics. Also, the ultimate well- 
founded model of a program with a monotone Tp is the least fixpoint of Tp. On 
the negative side, applying the ultimate approximation is computationally harder, 
and it was shown that computing the three types of ultimate semantics for propo- 
sitional programs is one level higher in the polynomial hierarchy than the standard 
versions of the same semantics. It was also shown that for important classes of logic 
programs, standard and ultimate semantics coincide. In fact, it seems that both se- 
mantics only differ for programs containing a recursively defined predicate p whose 
definition also uses reasoning by cases of the form (p A . . . ) V (->p A . . . ). In practice, 
such programs seem to be rare (we are unaware of any practical program with this 
feature). Thus, the standard semantics based on the Fitting operator and the ulti- 
mate semantics based on the ultimate approximation are two very close points in 
the hierarchy of semantics induced by approximation theory and represent different 
trade-offs between precision and complexity. 

In this work we apply the approximation theory in the context of rule sets with 
aggregate expressions. We extend the two- valued immediate consequence operator 
Tp for aggregate programs, define several different approximating operators of it 
and study the semantics obtained from them. One operator is the ultimate approx- 
imation of Tp. The three types of ultimate semantics obtained from this operator 
extend the corresponding ultimate semantics for logic programs. They also have the 
same attractive semantical properties and the high computational complexity. So, 
we also study extensions of the standard Kripke-Kleene, well-founded and stable se- 
mantics of logic programs. To achieve this, we propose the concept of a three- valued 
aggregate relation approximating a given aggregate relation. We use this concept 
to define an extension of the Fitting operator $p to the case of programs with 
aggregates. Since an aggregate relation is approximated by a class of three- valued 
aggregate relations, we obtain a sub- family of approximations of Tp, all of which 
coincide with the Fitting operator <f>p in case P does not contain aggregate expres- 
sions. Just as in the case of logic programming without aggregates, the different 
semantics based on the different approximation operators represent close points in 
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the hierarchy of semantics induced by approximation theory and provide different 
trade-offs between precision and complexity. 

2 Fixpoint Theory of Monotone and Non-monotone Operators 

We now present the necessary background on Approximation Theory. For more 
information we refer to IjDenecker et al. 2004|l . 

A structure (L, <) is a poset if < is a reflexive, asymmetric and transitive binary 
relation on L. A poset (L, <) is a chain if < is a total order, i.e., for each x, y G L, 
either x < y or y < x. Sometimes, when the order relation < is clear from the 
context we denote a poset simply with its domain L. 

A poset (L, <) is chain- complete if each chain S C L has a least upper bound 
lub(S') in L. Since the empty set is a chain, a chain-complete poset has a least 
element ±. 

A poset (L,<) is a complete lattice if each subset S of L has a least upper bound 
lub(S') and a greatest lower bound glb(5) in L. In particular, L has a least element 
_L and a largest element T. A complete lattice is chain-complete. 

An operator O : L — > L on a poset {L,<) is <-monotone if for each x,y G L, 
x < y implies 0(x) < 0(y). A monotone operator O on a chain-complete poset 
(L, <} has a least fixpoint lfp(O). This fixpoint can be constructively computed as 
a sequence of powers of O defined as follows: 

O T° (x) = x 

o r +i (x) = 0(0 r (»)) 

O | A (x) = lub({0 T a (x)\a< A}) for a limit ordinal A 
Proposition 2.1 

If O : L — > L is a monotone operator and L is a chain-complete poset then there 
exists an ordinal a such that O \ a (_L) = lfp(O). The least such ordinal is called 
the closure ordinal of O and is denoted by oo. 

Approximation theory is an extension of the above fixpoint theory to the case 
of arbitrary (non- monotone) lattice operators O. The following basic concepts are 
needed. 

Given a complete lattice (£,<), its bilattice is the structure {L 2 ,<,< p ) where 
for all x, y, x' , y' G L, 

(x, y) < (x' , y') if and only if x < x' and y < y' 
(x, y) < p (x' , y') if and only if x < x' and y' < y 

The order < on L 2 is called the product order, while < p is called the precision order. 
Both orders are complete lattice orders on L 2 . We are interested only in a subset 
of L 2 . A pair (x,y) is consistent if x < y and exact if a; = y. The set of consistent 
pairs is denoted by L c . 

A basic intuition in approximation theory is that a consistent pair (x, y) approx- 
imates an element by a lower and an upper bound. Hence, (x, y) approximates any 
element in the interval [x, y) = {z G L | x < z < y}. More precise pairs approximate 
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fewer elements: (x,y) < p (xi,j/i) implies [x,y] 3 [xi,yi]. Pairs (x,x) are called ex- 
act because they approximate a single element x. The set of exact pairs represents 
the embedding of L in L c . The product order is a complete lattice order on L c while 
< p is chain-complete on L c . Hence, any <-monotone or < p -monotone operator on 
L c has a least fixpoint in the < or < p order. Notice also that (_L, T) is the < p -least 
element of L c while the < p -maximal elements of L c are precisely the set of exact 
elements. 

Example 2.1 

Consider the lattice TWO = {f,t} of classical truth values ordered as f < t. We 
denote the set TWO c of consistent approximations of TWO with TH1Z££. 

The set TH1Z££ corresponds to the set of truth values {f , u, t} used in three- 
valued logic. The exact pairs (f,f) and (t,t), called false and true correspond to 
the values f and t, while (f,t) corresponds to u, called undefined. The product 
order on THREE, corresponds to the truth order f < u < t. The precision order 
on THTZ££ corresponds to the order u < p f , u < p t, and is sometimes called the 
knowledge order. 

We can define logical connectives in TH7Z££ in the following way. Conjunction 
A and disjunction V of two elements are defined as the greatest lower bound and 
the least upper bound with respect to the product order <. The negation operator 
is defined as ~>{x,y) — {~~<y, ~<x). In particular, -if = t, -it = f, and -iu = u. The 
truth tables of the connectives A, V, and -> are the same as in Kleene's strong 
three-valued logic. □ 

Definition 2.1 [Approximating Operator) 

Let O: L — > L be an operator on a complete lattice (L, <). We say that A: L c — > L° 
is an approximating operator of O if the following conditions are satisfied: 

• A extends O, i.e., A(x,x) = (0(x),0(x)) for every x £ L; 

• A is < p -monotone. 

We denote the projections of an approximating operator A: L c — > L c on the first 
and second components with A 1 and A 2 , i.e., if A(x,y) — (u,v) then A 1 (x,y) = u 
and A 2 (x, y) = v. 

There is a simple and natural intuition behind the concept of an approximating 
operator A. Any tuple (x, y) G L c can be viewed as a non-empty interval [x, y] — 
{z | x < z < y}. It is easy to see that for any z € [x,y], A(x,y) < p A(z,z) — 
(0(z),0(z)) which means that the set 0([x,y]) = {O(z) \ z G [x, y]} is a subset of 
the interval A(x,y). Hence, A 1 (x,y) is a lower estimate and A 2 (x,y) is an upper 
estimate to O(z), for each z in [x,y]. 

The < p -monotonicity of A guarantees that A has a least fixpoint called the 
Kripke-Kleene fixpoint of A and denoted by KK{A). This fixpoint approximates 
all fixpoints of O, i.e., if x = 0(x) then KK{A) < p (x,x). 

Next we define the stable and well-founded fixpoints of A. With a fixed element 
b e L, we can associate an operator A 1 ^, b) mapping any x G [_L, b] to A 1 (x, b). The 
operator b) is monotone but as a function from [_L, b] to L, in general it is not 
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internal in [_L, b}. Similarly, for a fixed element a G L, the operator A 2 (a, •) : [a, T] — > 
L is monotone, but in general is not internal in [a, T]. 

Definition 2.2 

Let (L, <) be a complete lattice. The upper stafr/e operator St A : L — > L is defined 
as 

S^(a) = , 9 /fe({x £ [a, T] : A 2 (a, x) < x}) 
and the lower stable operator St A : L — > L is defined as: 

St^fc) - e [_L, b] : A 1 (x, b) < x}). 

The upper stable operator maps a lattice element a to the greatest lower bound 
of the set of pre-fixpoints of A 2 (a, •). If A 2 (a, •) is internal in [a, T] then due to its 
monotonicity, St A (a) is its least fixpoint. The lower stable operator maps a lattice 
element b to the greatest lower bound of the set of pre-fixpoints of A x (-, b). This set 
may be empty, in which case St A (b) — T. However, if ^4 1 (-,6) is internal in [-L, 6], 
then St A (b) is its least fixpoint. 

Definition 2.3 

The stable revision operator StA'- L 2 — ► L 2 where (L,<) is a complete lattice is 
defined as follows: 

St A (a,b) = (St A (b),St\(a)). 

In general, the stable revision operator is not internal in the set L c . However, 
there is a subclass of L c on which this operator has very nice properties. It is defined 
as the intersection of the following subclasses: 

• A pair (a, b) E L c is A-reliable, if (a, 6) < p A(a,b). 

• A pair (a, b) E L c is A-prudent, if a < St A (b). 

It is easy to see that if (a, b) is A-reliable, then the operators b) and A 2 (a, •) are 
internal in their domain. On the other hand, if (a, 6) is A-prudent, we can guarantee 
that a is a safe underestimate of all fixpoints below b of the operator O. 

Intuitively, the stable revision operator implements two quite different approx- 
imation processes, one to refine the upper estimate b and one to refine the lower 
estimate a. Given a current upper estimate 6, we compute a new lower estimate 
by an iterative process x = -L,x\ = A 1 (x , b), . . . , x i+1 = A 1 (x i ,b), . . . until a 
fixpoint is reached. In each stage, we use A to approximate 0([xi,b]) from below, 
i.e., by setting x i+ i := A 1 (x i ,b) < O(z), for each z E [xi,b]. It is easy to see that 
each computed Xi is a lower estimate to each fixpoint of O below b, and the limit 
St A (b) is the best lower bound we can obtain through A to the set of these fix- 
points. On the other hand, the refined upper estimate is computed as a limit of 
the sequence yo = a, y\ = A 2 (a, yo), ■ • ■ , yi+i = A 2 (a, yi), .... The goal is to elim- 
inate non-minimal, non-reachable fixpoints of O above the current lower estimate 
a. Assuming that a < 0(a)(= A 2 (a,a) — yi), all points in [a, 0(a)] are considered 
reachable. On the next level, also points in 0([a, 0(a)]) above a are of interest, and 
we can approximate these points from above by computing A 2 (a,y 1 ) = y 2 . This 
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process is continued until the fixpoint St A (a) is reached and this fixpoint is taken 
as the new upper bound. 

In IjDenecker et al. 2004(1 . it was shown that the set L rp of pairs that are A-reliable 
and A-prudent contains (J_, T), is chain-complete, and the stable revision operator 
is an internal, < p -monotone operator in L rp . It follows that this operator has a least 
fixpoint, called the well-founded fixpoint of A and denoted by WF(A) . All consistent 
fixpoints of St a are A-reliable and A- prudent. They are called stable fixpoints of 
A and they are <-minimal fixpoints of A. The subset of exact stable fixpoints is 
denoted by ST (A). Exact stable fixpoints can be characterized alternatively as 
follows: x G L is an exact stable fixpoint if and only if 0(x) = x and lfp(A 1 (-, x)) — 
x. We have the following lemma. 

Lemma 2.1 

A stable fixpoint of A is a minimal pre-fixpoint of O. 

The inverse however is not true: not every minimal pre-fixpoint of O is a stable 
fixpoint of A. 

Proof 

Let x be a stable fixpoint of A and y a pre-fixpoint of O such that y < x. By anti- 
monotonicity of A 1 in the second argument, it holds that A 1 (y,x) < A 1 {y,y) = 
0{y) < y. Hence, y is a pre-fixpoint of and since x is the least fixpoint of 

this monotone operator, this implies that x < y. □ 

In general, a lattice operator O : L — > L may have many approximating operators. 
For any pair A, B of approximations of O, we define A < p B if and only if for each 
(x, y) £ L c , A(x, y) < p B{x, y). The following result about the relationship between 
the different classes of fixpoints of A and B was proven in l|Denecker et al. 2 004 ) . 

Theorem 2.1 

If A < p B, then KK(A) < p KK(B), WF(A) < p WF(B) and ST (A) C ST(B). 

So, more precise approximating operators lead to more precise Kripke-Kleene 
and Well-founded fixpoints, and to more exact stable fixpoints. It turns out that O 
has a most precise approximation Uo called the ultimate approximation of O. It is 
defined as: 

U (x,y) = (glb(0([a;, 1 ,])),lub(0([a ;) tf]))) 

where 0([x, y]) = {O(z) | z 6 The Kripke-Kleene, stable and well-founded 

fixpoints of Uo are called the ultimate Kripke-Kleene, ultimate stable and ultimate 
well-founded fixpoints of O. 

Theorem 2.2 \Denecker et al. 2004\ 

The ultimate Kripke-Kleene and ultimate well-founded fixpoint of O is the most 
precise of all Kripke-Kleene and well-founded fixpoints of all approximations of O. 
The set of ultimate exact stable fixpoints includes all exact stable fixpoints of all 
approximations A of O. 

A special case arises when O is monotone. 
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Theorem 2.3 \Denecker et al. 2004\ 

If O is monotone, then for every (x,y) G L c , Uo(x,y) = (0(x),0(y)) and its 
ultimate well-founded fixpoint is exact and is the least fixpoint of O. 

In IjDenecker et al. 2000)1 . it was shown that the Kripke-Kleene, the well-founded 
and the stable semantics of a logic program P correspond to Kripkc-Klcene, well- 
founded and exact stable fixpoints of the three- valued immediate consequence oper- 
ator <&p of P defined by Fitting (1985 1. In IjDenecker et al. 2003|l . analogous results 
were obtained in the context of default and autocpistemic logic. This shows that 
approximation theory formalizes an important non-monotonic principle. 



3 Aggregates 

3.1 Aggregate Functions and Relations 

In this text, an aggregate is understood as a second-order n-ary function or relation 
having at least one set argument. For simplicity, we assume that n = 1 in case of 
aggregate functions and n = 2 in case of aggregate relations. We denote the power- 
set of a set D with p(D). 

Definition 3.1 {Aggregate Functions and Relations) 

Let D\ and D2 be domains. An aggregate function is any function F: p(D\) —> 
An aggregate relation is any relation R C p(D\) x Z?2. 

We use F to denote an aggregate function and R to denote an aggregate relation. 
Although many aggregates are functions, for uniformity and convenience sake, our 
theory below is developed for aggregate relations. If an aggregate function F is used 
in a context which requires an aggregate relation, F is understood as its graph G F 
which is the aggregate relation defined as G F = {(S, d) | f(5) = d}. 

We now define a number of standard aggregate functions and relations which we 
study in this paper. We start with aggregate relations in the context of a poset 
(A<>: 

Definition 3.2 

• GLB C p(D) x D - defined as {(S,d) | S G p(D) and d = glb(S)}. 

• lub C p(D) x D - defined as {(S, d) \ S £ p(D) and d = lub(S)}. 

• lb C p(D) x D - defined as {(S, d) | Vi G S,d < x}. 

• UBC p(D) x D - defined as {(S, d) \ Va; G S, x < d}. 

• MIN C p(D) x D - defined as {(S, d) | d is a minimal element of S}. 

• MAX C p(D) x D - defined as {(S, d) \ d is a maximal element of S}. 

The aggregate relations GLB and lub are (graphs of) partial aggregate functions. 
If (D,<) is a complete lattice then GLB and lub are (graphs of) total aggregate 
functions. If (D, <) is a totally ordered set then min and max represent partial 
functions. If in addition D is finite then min and max represent total functions and 
min = GLB and MAX = lub. 

Next, we define aggregate functions on finite sets of numbers. Below, we assume 
that D is an arbitrary domain and D 1 is a Cartesian product D% x • • • x D n in which 
D\ is the set of real numbers R. Also, we denote the set of all finite subsets of a 
domain D by pp{D). 
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Definition 3.3 

• card: Pf{D) — ► N defined as CARd(S') = the cardinality of S; 

• SUM: Pf(D') —> R denned as sum(S) = ^ xy, 

(xi,...,x„)es 

• prod: Pf(D') — > K denned as prod(S') = ]~] xy 

(xi,...,Xn)€S 

• AVG C pp(D') x R (Average) - the graph of a partial aggregate function 
defined only for non-empty sets as (S,d) E AVG if d = SVM(S)/card(S). 

In the definition of SUM, prod and AVG, only the first clement of a tuple is used 
to compute the value. The reason to introduce the other arguments is to be able to 
count one number multiple times. That is, a set S C R x D2 x • • • x D n represents 
a multiset of real numbers. For example, when counting the total capacity of a 
building consisting of different rooms, we need to count the capacity of a room as 
many times as there are rooms with that capacity. 

All these aggregate functions have no natural extensions to infinite sets. How- 
ever, their graphs G F can be considered as aggregate relations on arbitrary sets — 
containing only tuples (S, d) for which S is finite. 

In this paper, we will focus only on aggregates with one set argument but our 
theory can be extended easily to the more general case. An example of an aggregate 
relation with two set arguments is the generalized quantifier MOST C p(D) x p(D) 
where most(A, B) expresses that most A's are B's. The relation MOST is defined 
as the set of all pairs of sets (A, B) such that CARd(A n B) > CARd(A \ B). 

3.2 Derived Aggregate Relations 

In this section we show how to obtain new aggregate relations by composition of 
existing aggregates with other relations. 

Definition 3.4 

The composition of an aggregate relation R C p{D\) x D2 with a binary relation 
P Q D2 x D3 is the aggregate relation Rp C p{D\) x D 3 defined as: 

R P = {(S,d) I 3d' e D 2 : (S,d') e r and (d',d) £ P). 

The composition of an aggregate function F: p(Z?i) — > D2 with a binary relation 
P Q P>2 x D 3 is the aggregate relation Fp C p{D{) x D 3 defined as: 

F P = {(5,d) I (F(S),d)eP}. 

Typically, the binary relation P is some partial order relation on the domain D2. 
For example, the CARD> aggregate relation is obtained as the composition of the 
CARD aggregate function with the > relation on N and contains all pairs (S, n) such 
that card(S') > n. 

An aggregate relation can also be composed with a relation on sets. We consider 
only one instance of this sort of composition. 
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Definition 3.5 

The subset aggregate of an aggregate relation R C p(D{) x Z? 2 is the aggregate 
Rc C p(-Di) x _D 2 defined as: 

Rc = {(5, d) | 35' : (£', d) e R A S' C 5}. 

For an aggregate function F, Fc denotes the subset aggregate of the graph of F. 
As an example, CARDc(<S l , d) holds if for some subset S' C 5, CARd(5') = d. In this 
case, the two derived aggregates CARDc and CARD> are identical. This is because 
there exists a subset S'CS such that CARd(S") = d if and only if CARd(S) > d. 

3.3 Monotone and Anti-monotone Aggregates 

We define two different notions of monotonicity of aggregates, one for functions and 
one for relations, and then show how they are related. 

Definition 3.6 

Let (£>2, <) be a poset and F: p{D\) — > D2 an aggregate function. We say that F 
is: 

• monotone if S\ C S2 implies f(5i) < f(S I 2); 

• anti-monotone if Si C 5 2 implies p(Sj.) > f(S I 2). 

The next two propositions list standard aggregate functions which are monotone 
or anti-monotone with respect to some partial order. 

Proposition 3.1 

Let (D, <) be a complete lattice. The aggregate function GLB: p(D) — > D is anti- 
monotone with respect to < and the aggregate function lub : p(D) — > D is mono- 
tone with respect to <. 

Standard result from lattice theory ( TJavey and Pristley 1990] Lemma 2.22). □ 

In Table we use the following notation for subsets of real numbers: R + for the 
set of non-negative numbers, R~ for the set of non-positive numbers, Rl 1,00 - 1 for 
the set of numbers in the interval [1, 00), and IRi 0,1 ) for the set of numbers in the 
interval [0, 1). 

Proposition 3.2 

Let D be a Cartesian product Dx x • • • x D n where n > 1. The aggregate functions 
in Table H from pf(D), the set of finite subsets of D, to the poset (D',<) arc 
monotone when D\ is as given in the table. 

Proof 

Follows immediately from well-known properties of real numbers. □ 

Monotonicity and anti-monotonicity of an aggregate relation are defined in the 
following way. 
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Table 1. Monotone aggregate functions on finite sets 
Definition 3. 7 

Let R C p(D 1 ) x D 2 be an aggregate relation. We say that R is: 

• monotone if (Si, d) £ R and Si C S 2 implies (S2, c?) £ R; 

• anti-monotone if (S2, (i) 6 R and Si C S2 implies (Si, d) G R. 

The next proposition summarizes the (anti-)monotonicity properties of the ag- 
gregate relations defined in Definition 13. 21 

Proposition 3.3 

The aggregate relations lb, ub C p(D) x D on a poset {D, <} are anti-monotone. 

All other relations defined in Definition 13.21 are neither a monotone nor an anti- 
monotone. 

We point out that the graph of a monotone aggregate function is not a monotone 
aggregate relation according to Definition 13.71 For example, lub is a monotone 
aggregate function but its graph is not a monotone aggregate relation. Instead, the 
composition of an aggregate function F with the inverse of the order with respect 
to which it is monotone results in a monotone aggregate relation. 

Proposition 3.4 

Let F: p(Di) — > D 2 be an aggregate function which is monotone with respect to a 
partial order relation < on D 2 . Then F> and F> are monotone aggregate relations 
and F< and F< are anti-monotone aggregate relations. 

Proof 

F>: Consider two sets Si and S 2 such that Si C S 2 C D\ and an element d G D 2 . 
Suppose that F>(Si,d) holds. By definition of F> this is equivalent to f(Si) > d. 
Since f is monotone we also have that f(Si) < f(S 2 ). So, we can conclude f(S2) > d 
which is equivalent to F>(S2, d). 

The monotonicity of F> and the anti-monotonicity of F< and F< can be proven 
in a similar fashion. □ 

As an application of this result we have that the aggregate relations CARD> and 
CARD> are monotone since the aggregate function card is monotone with respect 
to <. 

Forming the subset aggregate of any aggregate relation always results in a mono- 
tone aggregate relation. 



12 



Nikolay Pelov, Marc Denecker, Maurice Bruynooghe 



Proposition 3.5 

Let R C p(.Di) x Z? 2 be an arbitrary aggregate relation. 

1. Rc is a monotone aggregate relation. 

2. If R is a monotone aggregate relation then Rc = R. 

4 First-order Logic with Aggregates 

We introduce aggregates in the context of many-sorted first-order logic. 

A sort symbol (or simply sort) s denotes some sub-domain of the domain of 
discourse. A product type s± x • • • x s n represents the product of the domains rep- 
resented by the sorts s\, . . . , s n and a set type {s} represents the set of all sets of 
elements of sort s. 

Definition J^.l (Aggregate Signature) 

An aggregate signature E is a tuple (S: F; P; A) where 

• S is a set of sorts; 

• F is a set of sorted function symbols f : Si x • • • x s n — ► w where n > 0; 

• P is a set of sorted predicate symbols p: s\ x • • • x s n where n > 0; 

• A is a set of sorted aggregate symbols R: {si x • • • x s n } x w where n > 1. 

We use Sort(T,), Func(YA, Pred(S), and Aggr(E) to denote the sets 5", F, P, 
and A of S. We call a function symbol of the form / : — > w a constant. An aggregate 
symbol R: {si x • • • x s„} x w denotes an aggregate relation between sets of type 
Si x • • • x s n and objects of sort w. Of course, Aggr(H) may contain many instances 
of the same type of aggregate relation but with different sorts. 

For each sort s, we assume an infinite set V s of variables of sort s disjoint from 
the constants in Func(E). We denote variables, predicate symbols and function 
symbols with small letters and constants with capital letters. 

Definition 1^.2 (Terms and atoms) 

Let S be an aggregate signature. For every sort s E S, we define the set of terms 
of type s by induction: 

• a variable x £ V s of sort s is a term of type s; 

• if / : si x ■ ■ ■ x s n — > w E Func(T,) and t\, . . . , t n are terms of type s\, . . . ,s n 
respectively, then f(t\, . . . , t n ) is a term of type w. 

An atom has the form p(ti, . . . , t n ) where p: s\ x • • • x s n G Pred(Yi) is a predicate 
symbol and t\, . . . ,t n are terms of types s\, . . . , s n respectively. 

For a fixed aggregate signature S we define the notions of set expressions, aggre- 
gate atoms and formulas of the logic by simultaneous induction. 
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Definition 4-3 

A set expression of type {s\ x • • • x s n } has the form {(x\, . . . , x n ) \ (p} where ip is an 
aggregate formula called the condition of the expression and for each i = 1, . . . , n, 
Xi IS cL variable of sort s, . 

An aggregate atom has the form r(s, t) where R: {s\ x • • • x s n } x w G j4ggr(S) 
is an aggregate symbol, s is a set expression of type {s\ x • • • x s n }, and t is a term 
of type w. 

An aggregate formula is an atom, an aggregate atom, or an expression of the form 
ip A <P V -0, Vccy and 3x</? where 93 and ^ are aggregate formulas and x a 
variable. We also use ^ C ^ as an abbreviation for V -*tp. 

The set of aggregate formulas over S is denoted by £^ 9 '' ' . 

We illustrate the syntax of aggregate formulas with an example of modeling power 
plant maintenance. 

Example 4-1 (Power Plant Maintenance) 

A power plant has a number of power generators called units which have to be 
scheduled for maintenance. There is a restriction on the total capacity of the units 
in maintenance. Consider the following aggregate signature: 

£ = ({u,w 7 nat}; {Max: nat}; {capacity: u x nat,maint: ux w x w}; 

{sum: {nat x u} x nat}). 

The sort u is interpreted with units and the sort w with weeks. The predicate 
capacity(u, c) represents that a unit u has a capacity c. The predicate maint(u, s, e) 
specifies that unit u is in maintenance during the period starting at time point s 
(inclusive) and ending at time point e (exclusive) . The following aggregate formula 
expresses that the total capacity of the units in maintenance during a week, should 
not exceed a value Max: 

Vw SUM<({(c, u) I 3s3e(maint(u 7 s,e) A s < w < e A capacity(u, c))}, Max). 

In this formula the sum aggregate computes the sum of all capacities c of units u 
that are in maintenance during week w. Note that each capacity c is counted as 
many times as there are units u with capacity c. □ 

A positive literal is an atom p(ti, . . . , t n ) and a negative literal is the negation of 
an atom ->p(ti, . . . ,t n ). 

An occurrence of a variable x in an aggregate formula ip is bounded if it occurs in 
a subformula 3xip or \/xtp of ip or in a set expression {(xi, . . . , x, . . . , x n ) \ ip} in ip. 
An occurrence of x in ip is free if it is not bounded. The set of free variables of ip, 
denoted by Free(ip), is the set of all variables with at least one free occurrence in 
ip. Terms and formulas without variables are called ground and those without free 
variables are called closed. 

Now, we define the semantics of the logic. Let S be an aggregate signature. 

Definition 4-4 (Structure) 

A E- structure V consists of the following: 
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• for each sort s <E S a domain s v ; 

• for each function symbol / : s\ x • • • x s n — > w G Ftmc(£) a function 

f: S fx-x S ^/; 

• for each predicate symbol p: Si x • • • x s n G Pred(Y,) a relation 

P°Qsf x-xsf. 

• for each aggregate symbol R : {s\ x • • • x s n } x w 6 Aggr(E) an aggregate 
relation R v C p(sf x • • • x s^) x w 13 . 

Consider a S-formula <p(xi, . . . , x„) with free variables x\, . . . , x n of sorts si, . . . , s„, 
respectively and let di, . . . , d n be elements of , . . . , s^, respectively. Then, <p(di, . . . , d n ) 
denotes the formula obtained by substituting di for each free occurrence of a;, in 
ip. So, we consider domain elements as new constants of the respective sorts. We 
denote this enlarged signature with S(2?) and the corresponding set of formulas 

With C%fcy 

Definition 4-5 

The value \t\x> of a ground term t for a S-structure V is defined inductively as 
follows: 

• if t is a domain clement d, then \t\-D = d; 

• if t is a constant c, then \t\x> = c v ; 



if Us a term f(t u . . .,*„), then [t]z, = /"([ti]©, . . . , \t n \v). 



In the following definition and in the rest of the paper we often treat a relation 
R C D as a function i?: D — > {f, t} defined as R(d) = t if and only if d £ R for an 
element d E D. 

Definition 4-6 (Truth function) 

Given is a S-structure T>. We define the value \{x \ <p}Jv of a set expression {x | cp} 
and the truth value Hv{ip) of an aggregate formula ip by simultaneous induction. 
The value [{(zi, . . . ,x n ) \ ip(xi, . . . ,x n )}\-D of a set expression is the set 

{(di,...,d„) € sf x ••• x | =t} 

The tratft function Hv( ) ■ — * f° r closed aggregate formulas is de- 

fined in the following way: 

Hv(p(tl, *„)) - P^dtllc, • • • , \tn\v) 

Hv(R({x I p},t)) - R v (l{x I ^}]x,, [*]„) 

Ht>{-*p) = -iHv(p) 

Hv(<pVip) =Ht>(<p) VHt>W 

n v (p/\ip) =H V i<p) A Hv(ip) 

Ht>(3xip(x)) = \/ des T> Hx>(<p{d)) (where s the sort of x) 

Hv(^xip(x)) = f\ d( z s -D Wx>(V'(d)) (where s the sort of x) 

We define V [= ip if Ht>(<p) = t. When I? |= 93 we call 2? a model of The relation 
|= is called the truth relation or the satisfiability relation. When the structure V 
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is clear from the context, we drop the subscript V from the valuation function [•] 
and truth function H. 

We illustrate the use of first-order logic with aggregates to formalize the well- 
known magic square problem. 

Example (Magic Square) 

Given is a n x n grid which has to be filled with the integer numbers from 1 to n 2 
such that the sum of the numbers in all rows, columns, and two diagonals is equal 
to the same number M(n), known as the magic constant: 

„w \ n(n 2 + 1) 
M(n) = 1 

Consider the following aggregate signature: 

S = {{pos, nat}; {+ : not x not — > nat, * : not x not — ► nat, / : not x not — ► nat, 
Dim: nat,Mc: nat, f : pos x pos — > nat}; 0; {SUM: {nat} x nat}) 

The sort pos represents the positions of the table and the sort nat the values of the 
table. The function symbol / specifics the number in the corresponding row and 
column, the constant Dim gives the dimension of the grid, and the constant Mc 
gives the magic number. The problem is modeled by the following theory T: 

Mc = Dim * (Dim * Dim + l)/2 
Va;Vj/(l < f(x, y) < Dim * Dim) 

Vx{ix2iyiiy2(f(xi,y{) = f(x 2 ,y 2 ) -> x\ = x 2 l\y x = y 2 ) 

My SUM({z I 3x(z = f(x,y))},Mc) 

\fx SVM({z I 3y(z = f(x, y))}, Mc) 

SUM({z I 3x(z = f(x,x))},Mc) 

SUM({z I 3x(z = f(x, Dim + 1 - x))}, Mc) 

Consider any structure V such that DimP = n G N, pos v = {1, . . . , n} and natP = 
N. Then V is a model of T if and only if f v specifies a solution for the magic square 
problem of dimension n. □ 

5 Aggregate Programs 

In this section, we define the syntax of aggregate programs and introduce a basic 
semantical tool, the Tp operator. 

Given are an aggregate signature T, and a set of sorted predicate symbols IT. We 
call the symbols from S pre-defined or interpreted while those from II defined. With 
S(IT) we denote the aggregate signature consisting of both sets of symbols. 

From now until the end of this paper, we will assume a fixed aggregate signature 
X and a S-structure T> interpreting the pre-defined symbols. 

Remark 1 

Some of the pre-defined symbols are interpreted on standard domains like: 



1G 



Nikolay Pelov, Marc Denecker, Maurice Bruynooghe 



• sort symbols nat, int, real interpreted by the sets of natural, integer and real 
numbers respectively; 

• the standard function symbols +, *, — , ... on these sorts interpreted as the 
corresponding operations on numbers; 

• the standard predicate symbols =,<,... on these sorts interpreted as the 
corresponding relations on numbers; 

• all aggregate symbols defined in Section mi card, min, max, sum, .... 

Other interpreted symbols may be domain-specific. In the context of logic program- 
ming, the interpretation of the set Sd Q Sort(T,) of domain-specific sorts and the 
set Fd C Func(E) of domain-specific function symbols is normally given by the free 
term algebra generated by Fd- The interpretations s v of all sorts s € Sd and the 
interpretation f v of all function symbols / : si X • • • x s n — > s G Fd are defined by 
simultaneous induction as follows: 

• If h G sf,...,t n G S%, then /(*!,..., tn) 6 S V . 

• If h G sf ,...,*„ € s£, then f v (h,. ..,*„) = /(ti,.. .,t n ). 

In case there is only one domain-specific sort, the free term algebra corresponds 
to the Herbrand pre-interpretation, i.e., the Herbrand universe and the Herbrand 
interpretation of function symbols. 

The value of domain specific pre-defined predicate symbols may be defined by an 
extensional database on the domain of T>. □ 

A S(II)- aggregate rule r is of the form 

A «- ip 

where A is an atom of a defined predicate and ip is a S(IT)-aggregate formula. Note 
that ip may contain universal quantifiers. The atom A is called the head of the rule 
and the formula cp the body. We use body(r) to denote the body ip of r. A S(n)- 
aggregate program is a (possibly infinite) set of aggregate rules. A normal aggregate 
program is an aggregate program in which the bodies of all rules are conjunctions 
of literals and aggregate atoms. 

Now, we introduce the basic semantical constructs. 

The V-base 6asex>(n) of II is defined as 

basex>(n) = {p(d\, . . . ,d n ) \p: Si x • • • x s n £ II, and 

di G sf ,...,d n G s^}. 

The semantics of an aggregate program will be defined in the collection of E (re- 
structures extending V. For each subset I of 6asep(II), we define the S(II)-structure 
D(J) extending T> such that for every atom A G base-p (II): Wx>(z)(^4.) = t if and 
only if A e /. Clearly, this is a one-to-one correspondence between the subsets 
of 6asex>(n) and S(II)-extensions of V. In the rest of the paper, we exploit this 
correspondence and use subsets of base-p (n), called interpretations, to represent 
£(n)-extensions of T>. Sometimes, we also view an interpretation J as a mapping 
/: base v ^TWO. 

An interpretation / is a model of an aggregate program P if / is a model of the 
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first-order theory obtained from P by turning every rule A <— ip into an implication 
Vx(A C tp) where x are the free variables of A and tp. 

The set X = p(base-D(I\)) forms a complete lattice under the subset order C. This 
order extends to £(IT)-structures as follows: D(I) < D(J) if and only if / C J. 

We introduce the following notation. For any closed defined atom A — p(t±, . . . , t n ), 
\A\t> denotes the atom p([fi]x>, . . . , [*ra]x>) £ 6aser>(n). 

Definition 5.1 

The instantiation of a program P over a structure T> is defined as the set instz>(P) 
of all closed rules A *— ip such that: 

• there exists a rule A' <— ip' s P with free variables x% , . . . , x m of sorts 
si,...,s m , and 

• there exist domain elements d\ € sf, . . . , dm € s^, and 

• A= [A'(di,...,rf m )]x, and 

• = ^'(di, . . . ,dm). 

Note that the body of a rule in the instantiation of an aggregate program is a 
closed formula containing domain elements. 

We now define the two-valued immediate consequence operator of an aggregate 
program P. 

Definition 5.2 

The two-valued immediate consequence operator Tp 9 jff : X — > T of an aggregate 
program P is defined as: 

T^% r (I) = {A | A «- ip e inst v {P) and V(I) \= ip}. 



This operator extends the Tp operator for normal logic programs defined by van Emden and Kowalski (1976 1 

As for standard logic programs we have a correspondence between models of an 
aggregate program P and pre-fixpoints of Tp 9 ^. 

Proposition 5.1 

An interpretation / is a model of an aggregate program P if and only if J is a 
pre-fixpoint of T a p 9 %\ i.e., T a p % r {I) < I. 



Proof 

The proof is straightforward extension of the proof for standard logic programs. 
□ 



Definition 5.3 

An interpretation I is a supported model of an aggregate program P extending D 
if / is a fixpoint of Tp 9 ^. 

Although the supported model semantics is generally considered to be a weak 
semantics there are problems with aggregates for which it is the appropriate se- 
mantics. One such example is the Party Invitation problem ( |Ross and Sagiv 1997| ). 
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Example 5.1 {Party Invitation) 

A number of people are invited to a party. A person p will accept the invitation if 
and only if at least k of his (her) friends also accept the invitation. Consider the 
following aggregate signature: 

£ = ({person, nat}; {A: person, B: person}; 

{thr: person x nat, friend: person x person}; {CARD : {person} x nat}) 

and let 

II = {accept: person}. 

Here friend(x, y) means that y is a friend of x and thr(x, t) gives the lower bound t 
on the number of friends of x. The problem can be modeled by the following single 
rule: 

accept(x) <— thr(x,t) A CARD>({j/ | friend(x,y) A accept(y)},t). 
Consider an instance of the problem with two friends, say A and B. Each one 
of them accepts the invitation if and only if the other one accepts as well. This is 
represented by the S-structure T> in which person — {A, B} and in which JriendP 
and thr° are given by the following table: 

friend(A,B). thr(A,l). 
friend(B,A). thr(B,l). 

The aggregate program has two supported models in which accept is and {A, B} 
respectively. The second solution is not minimal but it is a correct solution to 
the problem. In reality, A and B may communicate with each other about their 
decisions to attend the party. □ 

Several other examples for which the supported model semantics is appropriate, 
including an elaborated version of the Party Invitation problem, can be found in 
l|Pelov 2004J) . It is worth noting that each of these examples can also be expressed 
in first order logic with aggregates using the completion of the aggregate program. 

5.1 Definite Aggregate Programs 

In the context of logic programming, definite logic programs are negation free logic 
programs. A definite program P characterizes a monotone Tp operator and its 
intended semantics is the least fixpoint of Tp. In this section, we extend the notion 
of definite program to programs with aggregates. 

We define the notions of positive, negative, and neutral aggregate formulas. This 
definition is not entirely syntactic, but also depends on the monotonicity or anti- 
monotonicity of aggregate symbols appearing in the formula. We can do that be- 
cause aggregate symbols always have a fixed interpretation given by the structure 
V. 

Definition 5-4 

An occurrence of a predicate P (resp. a formula iji) in an aggregate formula (p is 
neutral if it occurs in the condition 9 of an aggregate atom r({x \ 0},t) in ip such 
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that R v is neither monotone nor anti-monotone aggregate relation. Otherwise, the 
occurrence of P (resp. ip) is positive if the number of negations and aggregate 
atoms interpreted with an anti-monotone aggregate relation above P (resp. ip) is 
even and negative if the number of negations and aggregate atoms interpreted with 
an anti-monotone aggregate relation above P (resp. ip) is odd. 

Definition 5.5 (Positive and Negative Aggregate Formulas) 

An aggregate formula (p is positive if no defined predicate occurs negatively or 
neutrally in ip. An aggregate formula <p is negative if no defined predicate occurs 
positively or neutrally in <p. 

We note that in the above definition the polarity of pre-defined symbols does 
not matter. Moreover, if a formula does not contain defined atoms then it is both 
positive and negative. If the formula ip is an aggregate atom of the form r({x | ip}, t) 
there are three cases in which it can be positive. The first one is when R v is a 
monotone aggregate relation and <p is a positive formula. The second case is when 
R v is an anti-monotone aggregate relation and ip is a negative formula. The third 
one is when rP is arbitrary and ip does not contain defined predicates. Similarly, 
the aggregate atom r({x | <p>}, t) is negative if R v is a monotone aggregate relation 
and ip is negative, R v is an anti-monotone aggregate relation and (p is positive, or 
R v is an arbitrary aggregate relation and <p does not contain defined symbols. 

The main property of positive (resp. negative) aggregate formulas is that their 
satisfiability is monotone (resp. anti-monotone) for a given structure V. 

Proposition 5.2 

Let V be a S-structure and ip be a closed S(IT)-aggregate formula (possibly con- 
taining domain elements). For any pair I C J e base-u (IT), it holds that: 

• if ip is positive then V(I) \= ip implies V(J) \= ip; 

• if -0 is negative then T>( J) \= ip implies T>(I) \= ip. 

Proof 

The proof is by induction on the structure of ip. For positive and negative formulas 
without aggregates this property is a standard result in the theory of first-order 
logic. We consider only the case when ip is an aggregate atom r({x \ <p(x)},t) 
without free variables. Let Si = {{x \ (p(x)}}t>(i) an d Sj = f{x \ <p(x)}\x>(j). 
First, let ip be a positive aggregate atom. We distinguish three cases. 

1. R v is a monotone aggregate relation and ip(x) is a positive formula. For every well- 
sorted tuple d, ip(d) is a positive formula as well. By the induction hypothesis, 
we have that V(I) |= (p(d) implies V(J) |= ip(d). Consequently, Si C Sj. Finally, 
because R v is a monotone aggregate relation, (Si, pjx>) G R v implies (Sj, [£jx>) S 
R v . Thus, V(I) \= ip implies V(J) \= ip. 

2. R v is an anti-monotone aggregate relation and ip(x) is a negative formula. By the 
induction hypothesis, for every appropriate tuple d of domain elements assigned 
to x, we have T>(J) \= <p(d) implies T>(I) |= p(d). Consequently, Sj C Si. Fi- 
nally, because R° is an anti-monotone aggregate relation, (Si, |£jx>) <= R x> implies 
(Sj, Itjv) G R°. Thus, V(I) |= ip implies V(J) \= i\). 
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3. If ip contains no defined predicates, then V(I) \= ip if and only if D{ J) \= ip if and 
only if T> \= ip. 

The proof of anti-monotonicity of negative aggregate atoms is similar and is 
omitted. □ 

We point out that the class of positive (resp. negative) formulas are a strict subset 
of the class for which the satisfiability relation is monotone (resp. anti- monotone). 
For example the formula p V ~^p is a tautology and hence it is monotone, however 
it is neither positive nor negative. 

Definition 5.6 

A definite aggregate program is an aggregate program such that the bodies of all 
rules are positive aggregate formulas. 

The class of definite aggregate programs is an extension of the class of definite 
logic programs and has a monotone immediate consequence operator. 

Theorem 5.1 

If P is a definite aggregate program then Tp 9 ^ r is monotone. 
Proof 

Follows immediately from Proposition 15. 21 □ 
Definition 5. 1 

We define the least fixpoint model of a definite S(II)-aggregate program P extending 
T> as the least fixpoint of its immediate consequence operator Tp 9 ^. 

A well-known example that can be modeled as a definite aggregate program, 
is the Company Control problem (Ke mp and Stuckey 1991| IMumick et al. 19901 
|Ross and Sagiv 1997|IVan Celrier iMl . 

Example 5.2 (Company Control) 

Given is a set of companies which own shares in each other. The problem is to 
decide if a company x has a controlling interest in a company y. This is the case 
when x owns (directly or through intermediate companies controlled by a;) more 
than 50% of the stock of y. 

To model the problem we use the following aggregate signature: 

S = ({c, s}; 0; {owns stock: c x c x s}; {SUM: {s x c} x s}). 

The sort c represents companies and the sort s represents fractions of shares and is 
interpreted over the real interval [0..1]. The defined predicates are 

II = {controls: c x c}. 

The predicate ownsstock(x, y, s) means that a company x owns a fraction s of the 
stock of a company y and controls(x,y) means that x controls y. The problem is 
modeled by the aggregate program consisting of the following rule: 

controls(x,y) <— SUM>({(s,z) | (x = z V controls(x, z))f\ 

ownsstock(z, y, s)}, 0.5). 
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For numbers in the interval [0..1], the SUM aggregate lunction is monotone with 
respect to >. Consequently, by Proposition 13.41 SUM> is a monotone aggregate 
relation. Further, the formula in the aggregate atom is a positive formula, so the 
aggregate atom in the last rule is monotone. Since none of the bodies contain 
negation this is a definite aggregate program with a monotone Tp 9 ^ operator which 
has a least fixpoint /. □ 

We will now show that the least fixpoint of Tp 9 ^ corresponds to the solution 
to the company control problem. We start by giving a more precise definition of 
the control relation. Let sh{a, b) be a function which returns the fraction of shares 
of a company a in a company & or if a does not have shares in b. We define for 
every neN the level n control binary relation, denoted by C", by induction on n 
as follows: 

• C° = 0, i.e., no company has level control of another company; 

• C n+1 = {(a, b) | sh(a,b) + ^ a c \ eCn sh(c, b) > 0.5} for n > 0, i.e., a has a 
level n + 1 control over company b if the sum of the shares of a in b together 
with the shares of the companies which a has level n control in b is more than 
50%. 

Clearly, C n is an increasing sequence of relations. We define the controls relation 
between companies C as C = U„> C n , i.e., a company a controls a company b if, 
for some n > 0, (a, 6) e C . 

Proposition 5.3 
control s llp( - T ^ r) = C. 

Proof 

Let I n = Tp 9 % r t" (0) for n > 0. We will prove for each n > that controls 1 " = C n . 
Clearly, it follows from this that control s ltp( - Tr °^ ^ = C. 
For n = 0, controls 10 is empty and is equal to C. 

For n > 0, assume that controls 11 = C % for i = 0, . . . , n — 1. Fix two companies 
a and b and consider the value of the instance of the set expression: 

S = [[{(s, z) | (a — z V controls{a, z)) A owns stock (z, b, s)}]j n _ 1 . 

It is easy to see that if controls 1 ™- 1 = C" -1 then 

S = {(s, c) | (a, c) 6 C n ~ x and c contains s shares in b} U Si 

where 5*1 = {(s, a)} if a has s shares in b and Si = otherwise. It is straightforward 
then to see that controls 1 ™ contains (a, b) if and only if (a, b) € C™. □ 

Example 5.3 (Borel Sets) 

Let R be the set of real numbers. Borel sets are defined by the following monotone 
inductive definition: 

• any open set of real numbers is a Borel set; 

• for any countable set C of Borel sets, |~| C and (J C are Borel sets; 
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• if B is a Borel set then K — B is a Borel set. 

To model this definition as an aggregate program consider the following aggregate 
signature: 

E = ({s}; {compl: s — > s}; {open: s}; {glb£,lub" : {s} x s}). 

The E-structure T> interprets the sort s with the set p(K) of all subsets of the 
real numbers, the predicate open is interpreted with the set of open sets, and the 
function compl is interpreted as set complement: compl v (S) = M.—S. The aggregate 
relations GLB" and lub" are the restrictions of GLB and lub to countable input sets, 
i.e., for any set TZ of sets of real numbers and set S of real numbers, (TZ, S) £ GLB" 
if and only if \TZ\ < ui and S — f]TZ. The aggregate relations GLB^- and lub£ 
are obtained by forming the subset aggregates of GLB" and LUB" respectively (see 
Definition 13. 5J) . Then GLB£ (TZ, S) holds if S is the intersection of some countable 
subset of TZ. Likewise lub£ (TZ, S) holds if S is the union of some countable subset 
of TZ. 

The program defining Borel sets defines a single defined predicate borel : s and 
contains the following rules: 

borel(S) <— open(S). 
borel (compl (S)) <— borel(S). 

borel(S) «- GLb£({B | borel (B) }, S) V lub^ ({B \ borel(B)}, S). 

Each of these rules is the formal representation of one of the rules in the inductive 
definition of Borel sets. Since GLB^ and lub" are monotone aggregate relations 
(Proposition |^J) this is a definite aggregate program and it defines a monotone 
operator Tp 9 ^ r . Consequently, the set of Borel sets is the least set of sets closed 
under the rules of the inductive definition and this corresponds exactly to the least 
fixpoint of Tp 9 ^ r . Thus, borel(d) £ lfp (Tp 9 ^ r ) if and only if d is a Borel set. □ 



5.2 Stratified Aggregate programs 

The important class of stratified aggregate programs was already considered by 
several authors <|Mumick et al. 19901 IDell'Armi et al. 20031 IFaber et al. 2004jl . It 
is a natural extension of the concept of stratified logic program ( |Apt et al. 1988| ) 
where aggregates are treated as negative literals. 

Definition 5.8 

An aggregate program P is stratified if for each defined predicate p, there is a unique 
natural number > called the level of p such that if q occurs positively in the 
body B of a rule with head p, then \\q\\ < \\p\\ and if q occurs negatively in B or 
in an aggregate atom, then ||g|| < ||p||. The level ||P|| of P is the maximum of the 
levels of the defined predicates. 

For each level i, let Pi be the set of all rules with a predicate of level i in the 
head and IT the set of defined predicates of level i. Define for each i > 0, E; = 
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Assume i > 1 and fix an arbitrary £j_i -structure V extending T>. Notice that 
all predicates of IT occur only positively in bodies of Pi. Consequently Pi is a 
definite aggregate program and has a monotone Tp 9 ?£, operator. Note that it does 
not matter whether the aggregates in Pi are monotone or non-monotone, since they 
do not contain predicates of II,; . 

Definition 5.9 

The standard model of an aggregate program P extending T> is the interpretation 
/ = Ui<i<||p|| wriere the set {ij | 1 < i < \\P\\} is defined by the following (finite) 
induction: 

V Q =V; 

V l= V( |J /,-). 

i<i<i 

The aggregate program in the following example is a stratified aggregate program. 
Example 5.4 (Shortest Path) 

Consider the signature of directed weighted graphs 

5 = ({n, w}; 0; {edge: nxnx u>}; {min: {w} x w}). 

A S-structure T> interprets the sort n with a set of nodes, and the sort no, represent- 
ing weights, with some set of real numbers w v C R. The graph is defined by the 
relation edge where (a, b, w) G edge represents an edge from a to b with weight 
w. 

Consider the following formulation of the problem of finding the weight of the 
shortest path between two nodes which can be found in l|Van Gelder 19921 Example 
4.1). 

sp(x,y,w) <— min({c I cp(x, y, c)}, w). 

cp{x,y,c) <- edge(x,y,c). 

cp(x, y, ci + c 2 ) <- cp(x, z, ci) A edge(z, y, c 2 ). 

The aggregate relation MIN is neither monotone nor anti-monotone, so the aggre- 
gate atom min(. . .) in the first rule is neutral. Consequently, the program is not 
definite. However, the program is stratified. The first stratum which defines the 
cp/3 predicate is a definite logic program. The predicate cp/3 represents the tran- 
sitive closure of the graph: cp(a, b, w) is true in the least model of P cp if and only if 
there is a path between a and b with weight w. The second stratum contains only 
the definition of sp/3 and sp(a, 6, w) is true in the standard model of the program 
if and only if a shortest path between a and b exists and has weight w. □ 

6 Ultimate Semantics for Aggregate Programs 

We start our study of the semantics of general aggregate programs with a brief 
investigation of the semantics generated by the ultimate approximating opera- 
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tor Up 9 jt, r of Tp 9 ^. This semantics of aggregate programs was first studied by 
Denecker et al. (2001 1. 

Definition 6.1 

The ultimate approximating operator Up 9 ^, 1 " : J c — > X c of Tp 9 ^ r : T — > X is defined 
as: 

U a P % r (h,I 2 ) = ( f]T a P 99r (I), \jT a P %\I)). 

ie[hM ie[hM 

Definition 6.2 

The ultimate Kripke-Kleene model, the ultimate well-founded model, and the set of 
ultimate stable models of an aggregate program P are defined as the Kripke-Kleene, 
the well-founded, and the set of exact stable fixpoints of the Up 9 ^ r operator. 

Example 6.1 

Consider the following program with Herbrand universe {0, 1}: 

p(0) <— card({x\p(x)}, 1). 

Observe that this program has two supported models: and {p(0)}. Also, this is 
not a definite aggregate program and its immediate consequence operator is non- 
monotone as can be seen from: 

T a P T({p(o)}) = mo)}, 

T^(MO),p(l)}) = 0. 

Let us construct the well-founded fixpoint. We start from the pair (J_,T). The 
new upper and lower bounds are obtained by applying the stable revision operator 
of Definition 12.31 on (J-,T) = (0, {p(0),p(l)}). The new upper bound is the least 
fixpoint of (J7p 9 p r ) 2 (0, •). It is easy to see that : 

(C/^ r ) 2 (0,0) = T a P % r {®) =0. 

It follows that is a fixpoint of this operator; it is obviously the least fixpoint. 
Likewise, the new lower bound is the least fixpoint of (Up 9 ^) 1 ^, fp(0),p(l)}). This 
is as well, since: 

{U a P % r )\%,{p(Q),p{l)}) = ^ 
Consequently, the well-founded fixpoint is (0,0). This represents the two- valued 
interpretation and this is also the unique ultimate stable model of this program. 
The ultimate Kripke-Kleene model can be computed in two computation steps: 

(^T(Mp(o),p(i)}) = 0; (^T(Mp(o),p(i)}) = Mo)} 
(u a P %y(9Ap(o)}) = 0; (^) 2 (0,mo)}) = Mo)}- 

The model is the three-valued interpretation M0) u } = (0> MO)})- ^ 
We obtain the following corollary to Theorem 12 . 31 and Theorem 15 .11 
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Corollary 6.1 

If Tp 9 ^, r is monotone, then the ultimate well-founded fixpoint of P is the least 
fixpoint of Tp 9 j?, r and the unique ultimate stable fixpoint of P. If P is a definite 
aggregate program, then its ultimate well-founded model and unique ultimate stable 
model is the least fixpoint model of P. 

It follows that both the ultimate well-founded and the ultimate stable semantics 
correctly model the company control program in Example 15.21 and the Borel sets 
program in Example l5.3l Later, we will also show a similar result to Corollarv ltj. ll for 
stratified programs: if P is a stratified program then its ultimate well-founded and 
unique ultimate stable model coincide with the standard model of P. Hence, the 
ultimate semantics also models correctly the shortest path program in Example 1 5. 41 

Two aggregate programs with the same immediate consequence operator are 
equivalent under ultimate semantics. Since substituting formulas in rule bodies by 
equivalent formulas preserves the operator, this operation is equivalence preserving. 

Proposition 6.1 

Let P and P' be aggregate programs such that P' is obtained by substituting a 
formula if' for a formula tp in the body of a rule of P. If V((/J = if') is satisfied in all 
two-valued £(II)-extensions T>(I) of V, then Tp 9 ^ r — Tp 9 , 9 ^, and P and P' have the 
same ultimate Kripke-Kleene model, the same ultimate well-founded model, and 
the same set of ultimate stable models. 

Another result about the set of ultimate stable models is that they are always 
minimal models. In fact, we can prove such result for the set of stable models 
associated with any approximating operator of Tp 9 ^ r . 

Proposition 6.2 

Let P be an aggregate program and A be an approximating operator of Tp 9 Jp . Each 
stable model of A is a minimal model of P. 

Proof 

By Lemma |2. II every stable model of A is a minimal pre- fixpoint of Tp 9 ^ r and by 
Proposition the pre-fixpoints of Tp 9 ^ are exactly the models of P. □ 

The nice semantical properties of ultimate semantics come at a computational 
price. Even for programs without aggregates, computing the ultimate well-founded 
model is co-NP-hard and deciding the existence of a two-valued ultimate stable 
model is E^-complete fDen ecker et al. 2004|l . For this reason, we will study weaker 
semantics based on less precise approximations of Tp 9 ^. 

7 Extending the Standard Well-founded and Stable Semantics 

The goal of this section is to extend the Kripke-Kleene ^Fitting 1985| ), well-founded 
l|Van Gelder et al. 1991|) and stable <|Gelfond and Lifschitz 1988f semantics of nor- 
mal logic programming. According to Approximation Theory ( Denc cker et al. 2000(1 
these three semantics can be obtained from the three- valued immediate consequence 
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operator <£>p denned by Fitting (19851. In particular, the collection of three- valued 
interpretations corresponds to the set L c of consistent pairs of the lattice L of 
two- valued interpretations. The operator <I>p is an approximation on this set, it 
approximates the Tp operator and its stable and well-founded fixpoints correspond 
to the stable and well-founded models of P. By extending $p to the class of aggre- 
gate programs, we will be able to obtain well-founded and stable semantics which 
extend those of logic programs without aggregates. 

To extend the Fitting operator for aggregate programs, we must be able to eval- 
uate the aggregate formulas in three-valued interpretations. For this reason we in- 
troduce the concept of a three- valued structure. It is similar to standard structures, 
except that predicates are assigned three-valued relations and aggregate symbols 
are assigned three- valued aggregate relations. Because the value of a set expression 
in a three- valued structure can be a three- valued set, three- valued aggregates take 
three-valued sets as argument. We first illustrate these points with an example. 

Example 7.1 

We denote a three- valued set by indexing its certain elements with t and its possible 
elements with u. Let us fix the three-valued set S = {1*, 2 U , 3*, 5 U }. 

A three-valued aggregate Card of the cardinality aggregate CARD is a mapping 
from pairs of three- valued sets and natural numbers to THIZEE. In case of the set 
S, correct values for the set S are the following: 

Card(5, n) = u, for all n G {2, 3, 4} 
Card(5, n) = f , for all natural numbers n $ {2, 3, 4} 

This specifies that the set approximated by S has between two and four elements. 
Similarly, correct values in case of CARD> are as follows: 

CAKD>(S,ri) =t, for all n £ {0, 1,2} 
CARD>(S,n) = u, for all n E {3,4} 
Card(S, n) = f , for all natural numbers n $ {0, . . . , 4} 

This specifies that each set approximated by S certainly has more than zero, one 
and two elements, and has possibly more than three or four elements, but definitely 
has not more than five elements or more. A weaker but still correct value for S 
would be: 

CARD>(S,0) = t, 

CARD>(,S,n) = u, for all n e {1,2,3,4,5,6} 
Card(S', n) = f , for all natural numbers n > 6 

which specifies that the set approximated by S has certainly more than elements, 
possibly more than one to six elements and certainly not more than 7 elements or 
more. □ 

We now formalize the notions of three- valued aggregate relations and structures. 

Definition 7.1 (Three-valued Aggregate Relations) 

A three-valued aggregate relation is a function 1Z: p(Di) c x Di — > TTilZSS which 
satisfies: 
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• < p -monotonicity: for every pair of three-valued sets 5*1,52 E p(D\) c and for 
every d E D 2 , if Si < p S 2 then H(§i,d) < p K(S 2 ,d); 

• exactness: for every exact (two- valued) set 5 E p(-Di) and for every d E D%, 
K((S,S),d) E TWO. 

The concept of a three- valued aggregate relation is very similar to approximating 
operators (Definition 12.1(1 . 

Remark 2 

The definition has a straightforward extension to aggregates with multiple set argu- 
ments by requiring < p -monotonicity and exactness conditions for all set arguments. 

A three-valued aggregate relation TZ approximates an aggregate relation R if for 
each set 5 E p{D\) and for each d E D 2 , 1Z((S, S),d) — R(S,d). Due to the 
exactness condition, a three-valued aggregate relation approximates exactly one 
aggregate relation. 

Recall that TH1ZE8 = TWO c . It follows that a three- valued aggregate relation 
TZ: p(Di) c x D 2 — > TH7Z££ is completely determined by the pair (TZ 1 ,TZ 2 ) of 
its projections on the first and second component. These projections are relations 
H 1 , K 2 C p(D 1 ) c x D 2 such that TZ 1 C ^ 2 and K 1 , K 2 coincide on two-valued sets 2 . 
We will frequently define a three-valued aggregate relation 1Z by defining 1Z 1 and 
1Z 2 separately. 

Definition 7.2 (Three-valued Structure) 

Let E be an aggregate signature. A three-valued Y,-structure T> consists of the fol- 
lowing: 

• for each sort s E 5 a domain s 27 ; 

• for each function symbol / : s\ x • • • x s n — > w E Func(T,) a function 

• for each predicate symbol p: si x ••■ x s„ e Pred(S) a three-valued relation 

p 5 : sf x ••• x -> THK££. 

• for each aggregate symbol R: {si x ■ • • x s„} x m £ A(7gr(E) a three-valued 
aggregate relation 

R 5 : p((sf) c x • ■ ■ x (s*) c ) x u;* -> THKSE. 

A three- valued E-structure I? approximates a E-structure 2? if for each predicate 
symbol p 1 p° approximates p° and for each aggregate symbol R, vP approximates 

Now we define a precision order between three-valued aggregate relations and 
structures. 



2 Equivalently, 1Z 1 ,1Z 2 are functions p(Di) c X D2 — > TWO which coincide on two-valued sets 
and such that TZ 1 < TZ 2 . 



28 



Nikolay Pelov, Marc Denecker, Maurice Bruynooghe 



Definition 7.3 

For all three-valued aggregate relations 72i,72.2: p(Z?i) c xfl 2 ^ TH1Z££, define 
IZi < p IZ2 if TZi(S,d) < p lZ2(S,d) for every three-valued set S s p(Di) c and 
domain clement d £ Z?2- 

For all three- valued E-structures V\ and 7J>2, define T>\ < p T>2 if 2?i and T>2 
have the same domain, the same interpretations of sort and function symbols, for 
each predicate symbol p € -Pred(E), p Vl < p p® 2 and for each aggregate symbol 
R e Aggr(T,), K Vl < p bP 2 . 

It is straightforward to see that if 72i < p 72.2 and 72.2 approximates an aggregate 
relation R then 72i also approximates R. 

Definition 7.4 (Three-valued valuation and truth functions) 

Let E be an aggregate signature and V be a three- valued E-structure. We define the 
three-valued valuation function |-J^ for set expressions and the three-valued truth 
function TCf, for aggregate formulas by simultaneous induction. 

Let {(xi, . . . , x n ) | . . . , x n )} be a set expression of type {si x • • • x s n }. The 

value [{(xi, . . . , x n ) \ <p\\f, is the three-valued set S defined as: 

S(di, . . . ,dn) = 7tp(<^(di, . . . ,d n )) 

for every (d±, . . . , d n ) £ sf x ■ ■ • x s%. 

The three-valued truth function for first-order aggregate formulas 'Hf ) (-) '■ £^5) ~~ * 
TH1Z££ is defined as in Definition 14. 61 using the three- valued operations A, V, and 
-i as defined in Example l2.il 

Next, we show that the three- valued truth function 7Y-g, is monotone with respect 
to the precision order < p on three- valued interpretations. 

Proposition 7.1 

Let T>i and T>2 be three- valued E-structures. If T>\ < p T>2 then for every E-aggregate 
formula ip, H&^ip) < p Hf, 2 (tp). 

Proof 

The proof is by a standard induction argument on the size of ip. For aggregate 
atoms it follows from the < p -monotonicity of the three-valued aggregate relations. 
□ 

Another proposition shows the correspondence between three-valued and two- 
valued truth functions. If all predicate symbols have two-valued interpretations 
then evaluating an aggregate formula in a three-valued structure results in a two- 
valued truth value. 

Proposition 7.2 

Let 2? be a E-structure and T> be a three-valued structure which approximates 
T>. For every aggregate formula (p such that p 13 is two- valued for all predicates p 
appearing in tp, 7t^(ip) = (Hv(<p),Kv(tp))- 
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Proof 

The proof is by a standard induction argument on the size of (p. For aggregate atoms 
it follows from the exactness condition of the three- valued aggregate relations. □ 

In the sequel we will consider only three-valued structures for which only the 
interpretation of the defined predicates II and the aggregates is three- valued while 
the interpretation of the pre-defined predicates is two-valued. Such structures are 
denoted by V{I) where I: basef,(H) — > THUES gives the (three- valued) interpre- 
tation of the predicates in II. 

We now extend Definition 15 . 21 of the immediate consequence operator for aggre- 
gate programs to the three-valued case. 

Definition 7.5 

The three-valued immediate consequence operator ^ a p 9 ^ : 1 C — » X c for an aggregate 

program P maps any three- valued interpretation I to a three- valued interpretation 
I' = & p 99 £{I) such that for each ground defined atom A S base-p (II): 

I' (A) = \J{n m (ip) \A^cpe instf,(P)}. 

Or, the truth value of a defined atom A in I' is the greatest of all truth values of 
bodies of rule instances with A in the head. 

Proposition 7.3 

If 2? is a three-valued structure approximating T> then $ p ^ is an approximating 
operator of Tp 9 ^. 

Proof 

Follows from Proposition l7. II and Proposition l7.2l □ 

Definition 7.6 

Given a three-valued structure 2?, the T> -Kripke-Kleene model, the T> -well-founded 
model and the set of V-stable models of an aggregate program P are defined as 
the Kripke-Kleene, well-founded and the set of exact stable fixpoints of the <& p gg ? 
operator. 

Example 7.2 

Reconsider the program of Example 16. II with Herbrand universe {0, 1}: 

p(0) <— CARD({x\p(x)}, 1). 

We will show that, for an appropriate value of the three- valued aggregate, its stan- 
dard well-founded model is identical to its ultimate well-founded model. 

To compute stable and well-founded models, we need to choose a three-valued 
aggregate Card approximating CARD. Let us assume that 

Card({0 q , l a }, 1) = Card((0, {0, 1}), 1) = u = (f , t) 



CARD({0 a }, 1) = Card((0, {0}), 1) = u = (f , t). 
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Then the following assignments can be computed easily: 

^ a P % r y({p(0f,p(if}) -(*pT) 1 ((0,{p(O),p(i)})) =0; 

(*££) a ({p(o) a ,p(i) a }) =($pT) 2 ((0,MO),p(i)})) -MO)}; 
($ P T) 1 (M0) fi }) = (^ r ) 1 ((0,{p(o)})) = 0; 
($g") 2 (Mo) a }) = ($pj) 2 ((0, {p(o)})) = MO)}. 

These assignments are the same as for the ultimate approximation Up 9 ^ r . It follows 
that the empty set is the 2?-well-founded model and the unique Instable model 
of this program and {p(0) u } is the P-Kripke-Kleene model. □ 

For logic programs without aggregates each ^ p 9 !^ operator coincides with the 



operator <&p defined by Fitting (19851. So, the well-founded and stable semantics 
of aggregate programs is an extension of the well-founded and stable semantics of 
normal logic programs. 

Notice that the semantics of an aggregate program P depends on T> and, in par- 
ticular, on the choice of the three- valued aggregates. This means that we still have a 
family of different semantics. This family can be ordered by precision. Not surpris- 
ingly, using more precise three-valued aggregates leads to more precise semantics. 

The following proposition is a straightforward consequence of Proposition 17. II 

Proposition 7.4 

For every pair of three-valued S-structures T>\ and T>2 and for every three-valued 
interpretation /, if V x < p V 2 then (J) < p $ p ^ 2 (f). 

So, by Theorem l2.1l we obtain the following result. 

Theorem 7.1 

Let P be an aggregate program and T>\ and T>% be two three-valued E-structures 
such that T>x < p X> 2 - Then: 

• the £>i-Kripke-Kleene model of P is less precise (in the < p order) than the 
X>2-Kripke-Kleene model of P; 

• the T>i -well-founded model of P is less precise (in the < p order) than the 
X>2-well-founded model of P; 

• every Di-stable model is a XVstable model. 

The semantics that we have defined in this section do not satisfy all the strong 
declarative properties of the ultimate semantics defined in the previous section. For 
example, the 2?-well-founded model of an aggregate program with monotone im- 
mediate consequence operator is not necessarily its least fixpoint. E.g. the program 
{p<-pV ~~>P-} has a constant, hence monotone Tp with least fixpoint {p}, but in 
its well-founded model p is unknown. Also, substituting a formula for an equivalent 
formula in a rule body is not in general equivalence preserving. E.g. substituting 
true for p V —>p in the above program does not preserve equivalence. However, some 
interesting properties still hold. 
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Proposition 7.5 

Let P and P' be aggregate programs such that P' is obtained by substituting an 
aggregate formula ip' for an aggregate formula ip in the body of a rule of P. If 
\/(ip = ip') is satisfied in all three-valued E(Il)-extensions of T>, then P and P' are 
equivalent under the X>-Kripke-Kleene, 2?-well-founded and Instable semantics. 

Proof 

Follows from the fact that P and P' have the same three-valued immediate conse- 
quence operators. □ 

The three-valued equivalence condition in this proposition is strictly stronger 
than the two- valued equivalence condition in Proposition 16.11 For example true 
and p V —>p are equivalent in two- valued semantics but not in three- valued. 

Another important property is that in case of a stratified aggregate program P, 
the 2?-well-founded semantics and X>-stable semantics coincide with the standard 
semantics as defined in Section 15.21 and it does not matter how the aggregate 
relations are approximated by T>. 

Theorem 7.2 

Let P be a stratified S(II)-aggregate program. For any three- valued S-structure 
V approximating V, the P-well-founded model is two-valued and is equal to the 
standard model of P extending V and to the unique D-stable model of P. 

The proof of this result depends on the following lemma. 

Lemma 7.1 

Let ip be a closed S(II)-aggregate formula such that predicates of II do not occur 
in an aggregate atom. Let T> be a S-structure and D be a three- valued E-structure 
which is two-valued on all predicates in £ and approximates T>. Then for any 
three- valued interpretation (7i,/2), if the predicates in II occur only positively in 
ip then r H.f,^i 1 i 2 )(f) — (7~£v(i 1 )( l p),'Hv(i2)( l p)) an d if the predicates in II occur only 
negatively in ip then H^^ip) = (H v{l2 )((p),H v{Il) {<p)). 

Proof 

By simultaneous induction on the structure of (p. We give only the case when the 
predicates of II occur only positively in tp. The proof of the other case is symmetric. 

• For a pre-defined atom p(t\, . . . , t n ): 

^(jj./^M*!) •••=*«)) = Wb(p(*i> •••>**)) 

= (H v {p(ti,...,t n )),Hv{p{ti,...,t n ))). 

• For a user defined atom p(t\, . . . ,t n ): 

W6(Jx,j a )(p(*i. ■ ■ •>**)) = (h,h)(p(ltih, Mb)) 

= (hipdhjv, lt„h)),h(p(ltih, Wp))) 
= (Wu^jCp^i, . . ■ ,t n )),H-D(i a )(p(ti, ■ ■ -,t„))). 
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For a formula with negation -up: 

Kv(i 1 ,i 2 )(-' ( P) = " ,W P(/ 1 ,/ 2 )( 1 ^) b y DefinitionEl 

= ~'('HT>(i 2 )( l p),7~tv(i 1 )( i P)) by the induction hypothesis 

= h'Hv(i 1 )( l fi),^'Hv(i2)( l P)) by definition of 

= (Wx>(7i) Wp(/ 2 )(-iy)) by Definition Ol 

For a conjunction ip A ip.: 

= {Hv(i 1 )(<p),'H V (i 3 )(<p)) A (H v{Il) (ip),H v{l2) (ip)) 
= A H B(;i )()/)),Hi,( /2 )^) AHp(7 2) (i/))) 

The proofs for formulas of the form 93 V ip, 3xip, and Viry are analogous. 

Let r({x I ip}, t) be an aggregate atom. Since tp contains only pre-defined predicate 

symbols from £ then 

I ¥>}>*)) = I ¥>},*))• 

Moreover, since the interpretation by I? of all predicate symbols is two- valued we 
have (by Proposition 17. 2|) : 

HviHlx I ip}, t)) = (Hv(R({x I ip}, t)),n v (R({x I ip}, t))). 

□ 

Proof of Theorem \7.S2\ (Sketch) 

Let P be stratified by the level mapping ||.||, and let IL be the predicates of level 
i, Pi the set of rules of P with head in Ilj. 

Given an interpretation J £ I, let us define I\i as the restriction of I to the 
predicates of ILj, and I\<{ and 7|<j as the restriction of I to the predicates of 
{Jj <i Tlj, respectively those of (J J<i TIj-. We extend these notations also to three- 
valued interpretations. It is easy to see that for every i — 1,...,||P|| and every 
I, J 6 T c , if I\<i = J\<i, then 

*%g(f)\<i = *%g(j)\<i. 

Moreover, for every I e T c : 

It follows from Corollary 3.12 in IjVennekens et al. 2006|l that I is the well-founded 
fixpoint of $p 9 ? r if and only if for every i = 1, . . . , ||P||, I\i is the well-founded 
fixpoint of y 

The next step is to prove by induction that for every i = 1, . .., ||P||, the well- 
founded fixpoint I\i of $ fl is equal to (/Lih) where I is the standard model 

Pii'- > {I\<i) 

of P. This will show that the V- well-founded model I of P is equal to the standard 



Well-founded and Stable Semantics of Logic Programs with Aggregates 33 



model I of P. Fix i and assume that j|<j = (/|<j, /|<j). Let = SU U^i-^') 
2?' = P(/| < j) and 2?' = ©(/j^). Notice that T>' approximates V and 2?' is two- 
valued on all predicates of Sj_i (because 7|<i is a two- valued interpretation). Using 
Lemma 17. II we can show that for any three- valued IL-interpretation (7i, J2), 

*%%,(hM = (T^nr^,(/ 2 )). 

By Theorem 12 .31 it follows that Qp 99 ^, is also the ultimate approximation of Tp 99 £,. 

So, the 2?'-well-founded model of Pj is equal to the ultimate well-founded model of 
Pi extending V and, by Corollarv l6.il to the least fixpoint of Tp 99 £,. □ 

Since the $ a p 9 ^ operators are less precise than the ultimate approximation Up 9 ^ r 
we have the following corollary of Theorems 12.11 and 17.21 

Corollary 7.1 

For a stratified program P, the ultimate well-founded model is two-valued and is 
equal to the unique ultimate stable model and to the standard model of P. 

In the next sections we define several different three- valued aggregate relations 
and study the semantics obtained from the corresponding <& p ff ? r operator. 

7.1 Trivial Approximating Aggregates 

As a first example of a three-valued aggregate relation, we consider the least precise 
approximation of an aggregate. In the sequel, it will be convenient to view an 
aggregate relation both as a subset of p(-Di) x £> 2 and as a function from p(D\) x D 2 
to TWO. 

Definition 7. 7 ( Trivial Approximating Aggregate) 

Let R C p(Z?i) x Z?2 be an aggregate relation. The trivial approximating aggregate 
triv(K) : p{Di) c x D 2 — > THTZ8S of R is defined as follows: 



triv(R)((Si,S 2 ),d) 



(f,t) ifSi^S 2 

(R(S 1 ,d),R(S 1 ,d)) ifS x =S 2 



Proposition 7.6 

For every aggregate relation R, triv(R) is a three-valued aggregate relation and 
triv{R) approximates R. 

For a (two- valued) structure T> we define triv(V) as the three- valued structure in 
which every aggregate relation R is interpreted with triv(R). When T> is clear from 
the context we simply use triv. 

As mentioned earlier, the trivial approximating aggregate is the least precise 
three-valued aggregate relation. 

Proposition 7.7 

Let R be an aggregate relation. For every three-valued aggregate relation 1Z of R, 
triv(R) < p TZ. 

The trivial approximating aggregates are very imprecise. Nevertheless, by The- 
orem[^l they suffice to model the semantics of the important class of stratified 
aggregate programs. 
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Corollary 7.2 

Let P be a stratified aggregate program. The friu-well-founded model of P is two- 
valued and is equal to the standard model of P and the unique triv-stable model 
of P. 

This corollary shows that even the weakest instance of the well-founded and stable 
semantics suffices to define the standard model approach for stratified aggregate 
programs. 

7.2 Ultimate Approximating Aggregate 

In this section we investigate the instance of the <^ a ^ 9r operator in which aggre- 
gate symbols are interpreted with the most precise three- valued aggregate relation, 
called the ultimate approximating aggregate. This three-valued aggregate relation is 
defined for all aggregate relations in a uniform way using a construction similar to 
that of ultimate approximations. 

Definition 7.8 (Ultimate Approximating Aggregate) 

Let R C p(-Di) x D 2 be an aggregate relation. The ultimate approximating aggregate 
of R is a three- valued aggregate relation ult(R): p(Di) c x D 2 — > TTLTZES defined 
as: 

ult(R) 1 ((S u S 2 ) ) d) =t if and only if VS 6 [S 1 ,S 2 ]: (S,d) e R 
ult(R) 2 ((S u S 2 ) ) d) =t if and only if 35 e [S 1 ,S 2 ]: (S,d) e R 

Proposition 7.8 

For every aggregate relation R, ult(K) is a three- valued aggregate relation and ult(R) 
approximates R. 

For a structure T> we define ult(T>) as the three- valued structure in which every 
aggregate relation R is interpreted with ult(R). When V is clear from the context 
we simply use ult. 

The ultimate approximating aggregate ult(R) is the most precise in the < p -order 
among all possible three-valued aggregate relations. 

Proposition 7.9 

Let R be an aggregate relation. For every three-valued aggregate relation TZ which 
approximates R, TZ < p ult(R). 

So, $p 9 ^[ t is the most precise operator in the family of ^p 9 ^ operators and by 
Proposition 17.11 the w/i-well-founded and the ult-stable semantics are the most 
precise semantics of aggregate programs in this family. Of course, these semantics 
are still weaker than the ultimate well-founded and ultimate stable semantics from 
Sectional Recall that for the program {p p V p is true in the ultimate well- 
founded model but unknown in the u/i-well-founded model. An aggregate program 
showing similar behavior is given in the following example. 
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Example 7.3 

Consider the following program with Herbrand universe {0, 1, 2, 3}: 

p(x) <— CARD< ({x\p(x)}, 1) V CARD> ({x\p(x)}, 2). 

Note that the body is a tautology in 2-valued logic. Therefore, in the ultimate 
semantics, this program is equivalent with: 

p(x) <- t 

In the ultimate Kripke-Kleene, well-founded and unique stable model, p is true 
for all domain elements. On the other hand, in the ■u/t-well-founded model, each 
atom p(i) has value u and the program has no ult-stable model. This can be seen 
as follows. First, we observe that for each three-valued interpretation /, for every 
two-valued / approximated by /, (&p 99 u) 2 (I) is an upper bound of Tp 9 j£ (I) = 
{p(0), . . . ,p(3)} = T. Hence, (Qp 99 ^) 2 is a constant operator and maps each three- 
valued interpretation I to {p(0), . . . ,p(3)}. On the other hand, it is easy to see that 
for the three- valued set S = {0 a , . . . , 3"}, ult(CAKD<)^ (S , 1) and ult(CAB,B>y (S , 2) 
are both false, since S approximates sets with strictly more than one element and 
other sets with strictly less than two elements. Therefore, 

(*p 9 ^) 1 ({p(0) q , • ■ • ,p(3) q }) = (*%% t )H9, MO), • • ■ ,P(3)}) = 0. 

Hence, is the least hxpoint of {Qp 9 ^) 1 (■ , {p(0), ■ ■ ■ ,p(3)}) and {p(0), . . . ,p(3)} is 
the least fixpoint of ($p s fj' t ) 2 (0, •)■ It follows that the ult- well-founded fixpoint is 

{p(0f,...,p(3f} = (<D,{p(0),...,p(3)}). 

As for the stable semantics, any ult-st&ble fixpoint is also a supported fixpoint, 
and the only supported fixpoint is {p(0), . . . ,p(3)}. However, the least fixpoint of 
{^P%lt) 1 (; {p(0), • • • ,p(3)}) is not {p(0), . . . ,p(3)} but 0. Hence, there are no un- 
stable fixpoints. □ 

Now we look at characterizations of the ultimate approximating aggregate of 
some common aggregate functions. Such characterizations are useful for several 
purposes. First of all, they can be used in an implementation of the semantics. 
Second, they can be used for complexity analysis of the semantics. 

For monotone and anti-monotone aggregates the truth value can be computed 
directly on the boundary multisets. 

Proposition 7.10 

Let R: p(-Di) x D 2 be a monotone aggregate relation. Then ((Si, S2),d) £ ult(R) 1 
if and only if (Si, d) £ R and ((Si, S 2 ), d) E ult(R) 2 if and only if (S 2 , d) G R. 

Proposition 7.11 

Let R: p(-Di) x D 2 be an anti-monotone aggregate relation. Then ((Si,S2),d) £ 
ult(B.y if and only if (S 2 , d) G R and ((Si, S 2 ),d) G ult(R) 2 if and only if (Si, d) G R. 

Next, we look at extrema aggregates, possibly defined on infinite sets. 
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R || ((Si,S 2 ),d) G uZt(R) 1 iff | ((Si,S 2 ),d) G ult(R) 2 iff 



min d G Si A min(S 2 ,<2) d 6 S 2 A -.3d' G Si : d' < d 

MAX d G Si A MAx(S 2 , d) d G S 2 A -.3d' G Si : d! > d 

GLB glb(Si, d) A Glb(S 2 , d) glb(Si U (S 2 n [d, T]),d) 

lub lub (Si, d) A lub(S 2 , d) lub(Si U (S 2 n [_L,d]),d) 

Table 2. Characterization of ultimate approximating aggregates of extrema aggre- 
gates. 



Proposition 7.12 

The characterizations in Tableware correct. 

Next, look at the aggregate functions CARD, SUM and prod defined on finite sets. 

Proposition 7.13 (ult(cAKD)) 

For every three- valued set (Si, S 2 ) and element d: 

((S u S 2 ),d) GwZi(CARD) 1 if and only if |5 X | =d = \S 2 \ 
((Si,S 2 ),d) G tdi(CARD) 2 if and only if \Si\ < d < \S 2 \ 

Proposition 7.14 (uU(svm) and ult (prod)) 

For every three- valued set (Si, S 2 ) and element d: 

((Si,S 2 ),d) G ^(sum) 1 iff SUm(Si, d) A V(a;i, . . . , x n ) £ S 2 \ Si : x 1 = 
((Si, S 2 ),d) G ult(s\JM) 2 iff 3S' C S 2 \ Si : sum(S x U S', d) 

((Si,S 2 ),d) G u^(prod) 1 iff PROD(Si,d)A 

(d = 0W(ii,...,!t n ) G S 2 \Si: .ti = 1) 
((Si, S 2 ), d) G u/<(prod) 2 iff 3S' C S 2 \ Si : prod(Si U S', d). 

Proof 

The proof is straightforward and uses only basic properties of numbers. □ 
Note that the definition of m^(sum) 2 and m^(prod) 2 is simply a reformulation 



of the original definition of ult. In fact, Pelov (20041 shows that the complexity 
of computing ult(svM) 2 and w^(prod) 2 is NP-complete so it is unlikely that any 
efficient algorithms can be found. 

Finally, we look at combined aggregate relations of the form F> and F< where 
F: pp(Di) — > D 2 is an aggregate function on finite sets and < is a total order on 
D 2 . For all three aggregate functions card, SUM, and prod we can give efficient 
algorithms for ult(F>) 1 and idt(F>) 2 . We start with the following general result. Let 
min F ,max F : pp(Di) c — > pp(Di) be functions which for a given finite three-valued 
set (Si, S 2 ) return a set S G [Si, S 2 ] such that f(S) is minimal (resp. maximal) over 
the set [Si, S 2 ], i.e., 

VS' G [Si,S 2 ]: F(min F (Si,S 2 )) < f(S') 
VS' G [Si,S 2 ]: F(max F (Si,S 2 )) > f(S'). 
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Note that for a given aggregate function F and a three- valued set (Si,S 2 ) there 
may be more than one set in the interval [Si, 62] with a minimal value of F. For 
example min SUM (0, {0}) can return either or {0}. 

The values of it/i(F<) and ult(F>) can be computed using the min F and max F 
functions in the following way. 

Proposition 7.15 

Let F: pp(Di) — > D 2 be an aggregate function and < be a total order on D 2 . Then: 

((Si,S 2 ),d) G ultiFy) 1 if and only if F(min F (5i, 5 2 )) > d 
((5i, 5 2 ), d) G ult(F>) 2 if and only if F(max F (5i, 5 2 )) > d 

and similarly for ult(F<): 

((Si,S 2 ),d) G ultiFK) 1 if and only if F(max F (5i, S 2 )) < d 
((Si,S 2 ),d) eult(F<) 2 if and only if F(min F (S*i,5 2 )) < d. 

Proof 

First note that, since {f(5) | 5 G [Si, S 2 ]} is a finite totally ordered set, it always 
has a minimal and a maximal element. We give the proof for ult(F>) . 

((5i,5 2 ),d) Gu/^f^) 1 
^VS G [5i,5 2 ]: (5,d) G F> 
^VS G [51,52]: f(S) > d 
■^Nx G {f(S) I S* G [5i, S 2 }}: x>d 
<^F(min F (S , i, 5*2)) > d. 

The proofs of the other cases are analogous. □ 

So, to decide the first and second components of ult(F>)((Si, S 2 ), d) we need to 
compute the values min F (5i, S 2 ) and max F (5i, S 2 ). First, we show how to compute 
these two functions for any monotone or anti-monotone aggregate function. 

Proposition 7.16 

If F is a monotone aggregate function with respect to < then min F (51,52) = 5i 
and max F (5i, 5a) = S 2 - If F is an anti-monotone aggregate function with respect 
to < then min F (Si, S 2 ) = S 2 and max F (5i, 5a) = Si. 

This proposition can be applied to all aggregate functions listed in Tabled 
For aggregate functions which are non-monotone the idea is to partition the under 
and over-estimate of the input three-valued set to subsets on which the aggregate 
function is monotone or anti-monotone. Then we combine the sets on which the 
function is monotone to obtain min F and the sets on which it is anti-monotone to 
obtain max F . We illustrate this idea for the SUM and PROD aggregate functions. 

Below, S + denotes the set {(xi, . . . ,x n ) G 5 | X\ > 0} and S~ denotes the set 
{(xi, ...,x„) G 5 I xi < 0}. 
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Proposition 7.17 

For every three- valued set (Si, S2): 

min SUM (Si, S 2 ) = S+ U S^ 
max SUM (Si, S 2 ) = Sf U S^ . 

Proof 

Clearly the set S' £ [Si , S2] with minimal sum is obtained by taking Si and all 
tuples with negative numbers from S2 \ Si , that is 

min SUM (Si, S 2 ) = Si U (S 2 \ Si)" = S+ U Sf U (S 2 ~ \ Sf) = S+ U Sj. 

□ 

The aggregate function PROD is non-monotone for sets with arbitrary real num- 
bers as first argument but is monotone for sets with the first argument in the interval 
[1, 00) and anti-monotone for sets with the first argument in the interval [0, 1]. So, 
for PROD on non-negative real numbers we can give a similar algorithm as for SUM 
in Proposition 17. 171 Below, S^ 1 ' 00 ) denotes the set {(aii, . . . ,x n ) G S | x\ G [l,oo)} 
and Si - 1 ) denotes the set {(x\, . . . , x n ) G S | x\ G [0, 1)}. 

Proposition 7.18 

For the aggregate function prod r+ : p F (R + ) — > R+, 

mm pROD»+ (^l' ^2) = S[ 1,00) U S 2 0,1) 
maX PROD»+ S l) = U S^ 1 ' 00 '. 

The algorithms for PROD on the entire set of real numbers are more complicated 
and can be found in IjPelov 2004(1 . 

As an application of the w^-well-founded semantics we consider a recursive for- 
mulation of the shortest path problem. 

Example 7.4 (Shortest Path) 

Consider the following formulation of the problem of finding the shortest path 
l|Van Gelder 19921 Example 4.1): 

sp(x, y, w) «- min({c I cp(x, y, c)}, w). 

cp{x,y,c) <- edge(x,y,c). 

cp(x, y, ci + c 2 ) <- sp(x, z, ci) A edge{z, y, c 2 ). 

The only difference between this program and the formulation of the shortest path 
in Example 15.41 is that we have replaced the cp predicate in the body of the second 
clause of cp with the sp predicate. We have incorporated the knowledge that any 
shortest path of length n + 1 must be an extension of a shortest path of length n. 
This fact is the basis of Dijkstra's algorithm. However, the program is no longer 
stratified because the sp/3 predicate depends on itself through the min aggregate 
relation which is non-monotone. □ 

It turns out the above program is only correct under certain conditions on the 
graph. 
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Proposition 7.19 

Let edge 1 ' be a graph with the property that for any pair of nodes a and b, if there 
is a path from a to b, there is a shortest path from a to b. In the w^-well-founded 
model of the shortest path program from Example 17 . 41 an atom sp(a, b, w) is: 

• true if a shortest path between a and b exists and has weight u>; 

• false otherwise. 

The proof of the proposition and a deeper analysis of this program is given in 
|Appendix A| There are several types of graphs for which we can show that when 
there is a path from one node to another, there is a shortest path between these 
nodes: acyclic finite graphs, finite graphs with non-negative weights and infinite 
graphs with weights in No- It follows from Proposition 17. 191 that for these types of 
graphs, the programs in Example l5.4l and Example l7.4l are equivalent. 

There are also types of graphs which do not satisfy the condition. Connected 
nodes without shortest path can occur if there is a cycle with a negative weight 
between the two nodes. It can also occur in infinite graphs. An example of such 
a graph is {(0,n + 2, 1), (n + 2, 1, 1/n + 2) | n e N}; although there are an infi- 
nite number of paths from to 1, there is no shortest path between and 1 (see 



Ross and Sagiv (1997 ) for another example). In the appendix, we show that in such 
graphs, the well-founded model may be three- valued or may even contain erroneous 
true sp(a, 6, w) atoms, i.e., there are paths between a and b with strictly less weight 
than w. 

We conclude the section on ultimate approximating aggregates by showing that 
for definite aggregate programs, the tt/i-well-founded and u/t-stable semantics are 
equal to the least fixpoint of the Tp 9 ^ which we defined in Section I5"T1 The key to 
the proof of this result is the following lemma. 

Lemma 1.2 

Let V — ultiV) and let (Ii,h) be a three-valued interpretation. If (p is a closed 
positive aggregate formula then Hf,( IuI ^(<p) = {Hv(i 1 )(<p),'Hv(i2)( < P))- If (f is a 
closed negative aggregate formula then Tigg^)^) = {Hv(i 3 )(<p),'Hv(h)(<p))- 



Proof 

The proof extends that of Lemma l7.1l with several new cases when ip is an aggregate 
atom r({x I ip}, t), ip contains defined predicates and R v is either a monotone or an 
anti-monotone aggregate relation. We consider only the case when B? is a monotone 
aggregate relation and ip is a positive aggregate formula. The other three cases (r d 
monotone and ip negative and R/ D anti-monotone and ip positive or negative) are 
symmetric. First, note that since ip(x) is a positive formula then for every tuple of 
domain elements d, ip{d) is also positive. So, 

H (h , h) (iP(d)) - (HM~d)),n l2 (iP(d))) 

and consequently 

m I Tp}h (Il , l2) - ([{a? I iftWo, It* I <P}}v(i 2) ). 
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We have: 

W6(i 1 ,/ a )(*({3MV'},t)) 

(by Proposition 17. lull 

□ 

Theorem 7.3 

Let P be a definite aggregate program. Then P has a two-valued wii-well-founded 
model (M, M) which is also the single uZt-stable model. Moreover M — lfp(Tp^ r ). 

Proof 

From Lemma 1*7.21 follows that if P is a positive aggregate program then 

* a P %{h,h) = {T a p % r {h).T a p % r {I 2 )). 

By Theorem 12.31 follows that ^p 9 f^ t is also the ultimate approximation of Tp 9 j^ . So, 
the w/i-well-founded model of P is equal to the ultimate well-founded model of P 
extending V and, by Corollary 16. II to the least fixpoint of Tp 9 ^ . □ 

7.3 Bound Approximating Aggregate 

The ultimate approximating aggregates have the disadvantage that they may have 
a higher complexity than the aggregate relations which they approximate. We al- 
ready mentioned that the complexity of ult(s\JM) 2 and m^(prod) 2 is NP-complete 
IjPelov 2004(1 while the complexity of SUM and PROD is polynomial. In this section 
we define a less precise approximating operator for aggregate functions that are 
defined on totally ordered finite sets of numbers. 

Definition 7.9 

Let F: pp(Di) — > D 2 be an aggregate function and (D 2 , <) be a totally ordered set. 
The bound approximating aggregate bnd{v): pp(Di) c x D 2 — » TTCR.EE is defined 
as follows 

((S 1 ,S 2 ),d) e bnd{Ff if and only if F(nun F (Si, S 2 )) = d = F(max P (5i, S 2 )) 
({S u S 2 ),d) S bnd(F) 2 if and only if F(min F (S'i ) S 2 )) <d< F(max F (5i, S 2 )). 

Note that by using Proposition 17.151 the definition of bnd(v) is equivalent to 
ult{¥>) A ult{¥<) where A is the greatest lower bound in TH1ZSS with respect to 
the < order (see Example l2.1[l . 

Proposition 7.20 

Let F: pp(Di) —> D 2 be an aggregate function and (D 2 , <) be a totally ordered 
set. Then bnd(F) is a three- valued aggregate relation of F. 

It is interesting to see how bnd(F) compares to uU(f). We first show that the first 
components of the two three-valued aggregate relations are always equal. 



= w/t(R)([{S|^}k (Jl , l2) ,M) 

= ult(K)((l{x I Mfofr), I{2? I V>}W 2 ))> W) 

= (r(1{x 1 V/Wo. [*]),*([{* I V-}]p ( / 2 ), [*])) 

- (H v{h) (R({x I <p},t)),H v{l2) (R({x I i/j},t))). 



Well-founded and Stable Semantics of Logic Programs with Aggregates 41 



Proposition 7.21 

Let F: pp(Di) — > D 2 be an aggregate function and < a total order on D 2 . Then 
bnd(F) 1 = ult(F) 1 . 

However, for most aggregate functions, bnd(F) can be less precise than ult(F), 
i.e., bnd(j) < p ult(F) because bnd(F) 2 > ult(F) 2 . 

Example 7.5 

The pair ((0, {1, 3}), 2) is not in the relation ult(s\JM) 2 because there is no set 
S G [0, {1, 3}] such that SVM(S) = 2. On the other hand ((0, {1, 3}), 2) 6 bnd(s\JM) 2 
because 

bi = SUM(min SUM (0, {1, 3})) = SUm(0) = 0, 

b 2 = SUM(max SUM (0, {1, 3})) = SUm({1, 3}) = 4, 

and bi < 2 < b 2 . So, ult(svM) 2 C bnd(svM) 2 and consequently bnd^SVM) < p 
uU(svm). □ 



7-4 On the Complexity 

In this section we prove a simple complexity result. A full analysis of the complexity 
of model generation for aggregate programs is beyond the scope of this paper but 
we show that, despite the second order nature of aggregates, model generation for 
aggregate programs may remain tractable under an appropriate choice of the three 
valued aggregates. 

The type of computational problem considered here is the model extension prob- 
lem l|Mitchell and Ternovska 20(~)5J> : given a signature E(II), an aggregate program 
P based on E(II) and an input E-structure T> which is two- valued on all predi- 
cates and three-valued on aggregate symbols, compute the Kripke-Kleene model, 
the well-founded model or an exact stable model. Informally, we are interested here 
in the complexity for instances of these problems with fixed P, fixed E(IT), and 
"fixed" interpretation of the aggregate symbols in Aggr(H) and for varying but 
finite input £ \ Ag<?r(I])-structures T> . The problem with this intuition is that the 
interpretation R 15 of a given aggregate symbol R in this class is of course not really 
fixed: it varies with the input structure T> . We are interested in classes of problems 
where for example the sum predicate is systematically interpreted by its ultimate 
approximating aggregate, but evidently, the sum aggregate relation and its ultimate 
approximating aggregate depend on the domain of the input structure V a . 

To circumvent this problem, we introduce the following concepts. Let us fix an 
aggregate program P based on signature S(il). Consider the class C of two- valued 
structures of S \ Aggr{Yi) with a finite domain (i.e., with finite domains for ev- 
ery sort s). We assume a given function X from C to the class of E-structures 
such that for each T> £ C, X(T> ) extends V with three- valued aggregates 1Z for 
each aggregate symbol R £ Aggr (£). Moreover, we assume that for each aggregate 
symbol R £ Aggr(Y<), there are two Turing machines which, for (an appropriate 
encoding of) arbitrary T> £ C and arbitrary well-typed pair (5 1 , d) consisting of a 
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three-valued set S and a domain element d, compute respectively (r x ^ t ' 0>> ) 1 (S, d) 
and (R x ^) 2 (S,d). 

Let X(C) be the image class of C under the function X. The main result of 
this section is that if each three-valued aggregate 1Z € Aggr(E) is polynomially 
computable in the size of the domain of the input structure, i.e., if for arbitrary T> , 
S and d, the Turing machines associated to 1Z can compute (r x ^ t>0 ') 1 (S, d) and 
(r x ^ t ' ^) 2 (S, d) in polynomial time in the size of the domain of T> (i.e., the total 
number of elements in all domains s " of all sorts s), then the following holds: 

Theorem 7.4 

• deciding if an atom A is true in the Kripke-Kleene model of a program P 
extending a structure in the class X(C) is in P; 

• deciding if an atom A is true in the well-founded model of a program P 
extending a structure in the class X(C) is in P; 

• deciding the existence of an exact stable model of a program P extending a 
structure in the class X(C) is in NP. 

Note that, in the common case, computing the value of an aggregate is polynomial 
in the size of the input three- valued set S (of a fixed type s\ x • • • x s n ). Then, since 
the number of elements in such a set is bounded by a polynomial in the size of the 
domain of T> , computing the value of the aggregate is certainly polynomial in the 
size of the domain of the input structure T> . 

Proof 

Let L be a (finite) lattice and A an approximation operator on L c . Suppose that 
n is the length of the longest chain in L. The computation of the Kripke-Kleene 
and well-founded fixpoint of A and the test whether a lattice element is an exact 
stable fixpoint of A is done by monotone fixpoint computations. It is easy to see 
that the number of applications of A to compute its Kripke-Kleene fixpoint is 
bound by n. Also testing whether a lattice element is an exact stable fixpoint of A 
requires at most n applications of A. Because the computation of the well-founded 
fixpoint involves an embedded fixpoint computation, its computation takes at most 
n 2 applications of A. 

Let us now consider a model extension problem for fixed P, £(II) and aggregate 
extension function X. Given an input E \ Aggr(E)-structure T> , the lattice in 
which the computations take place is the power-set lattice 2 = p(base-n o (II)). The 
maximal chain length in this lattice is the number of facts, i.e., the cardinality of 
&asex> o (II). This number is polynomial in the size of the domain of T> . It follows 
then that all we need to prove to obtain the desired complexity results is that for 
any given pair J G I, we can compute &°^x(v )(^) m polynomial time in the size 
of the domain. 

From Definition 17.51 it follows that &px(p corresponds to computing the 
truth value of the bodies of all rule instances A <— <p e inst(P). It is clear that the 
number of rule instances is polynomial in the size of the domain of T> . Therefore, all 
we need to prove is that for an arbitrary aggregate formula ip[x] with free variables 
x, the truth value of tp\x/d] in J, for arbitrary T> £ C and tuple d of domain 
elements, can be computed in polynomial time in the domain size of T> . 
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In case tp is a first order formula, the polynomial computability of its truth value 
with respect to a three- valued structure is proven by induction on the structure of 
tp. We need to extend this proof with the additional case that ip is an aggregate 
atom. Computing the truth value of an aggregate atom r(s, d) requires firstly, 
to evaluate the contained set expression s and compute its three-valued set S, 
and secondly, to evaluate the truth value of (Tl x( - v ^) l (S,d) or (K X ( V °)) 2 (S, d). 
It follows straightforwardly from the induction hypothesis that the first task can 
be done in polynomial time in the domain size of T> , whereas the second task is 
polynomial by assumption. □ 

Our main motivation for developing the semantic framework of this section was 
the high complexity of the ultimate well-founded and stable semantics as defined 
in Section [H| This result shows that under appropriate choice of the three- valued 
aggregates, we indeed obtain weaker but tractable Kripke-Kleene and well-founded 
semantics. 

The above theorem is subject to a strong limitation, in particular the restriction 
to finite structures. Many of the interesting applications of aggregates (including 
the company control and the shortest path problem) contain integer or real num- 
bered domains. Frequently used aggregates such as SUM and CARD range over these 
infinite domains. Clearly, in the context of infinite domains, only strong syntac- 
tical conditions on the form of rules can guarantee termination or tractability of 
the model generation process. But this is the case whether the program contains 
aggregate expressions or not. To discover such conditions is an important topic for 
future research but it is beyond the scope of this paper to investigate this issue. 

7. 5 Summary of the results 

In this and the previous section, we have introduced a family of Kripke-Kleene, 
a family of stable and a family of well-founded semantics for aggregate programs, 
parameterized by the approximation operator. We introduced also a sub-family of 
these semantics, obtained by extending the standard three-valued Fitting operator 
with different three- valued aggregates. We presented two generic ways for deriving 
a three-valued aggregate from a given (two-valued) aggregate relation, called the 
trivial and the ultimate three-valued aggregate. 

All instances in each of the three families are consistent with each other. In partic- 
ular, when one instance of the Kripke-Kleene or well-founded semantics infers that 
a literal is true, there is no instance in which this literal is false and any more pre- 
cise instance of the same semantics will infer the same literal. Also, a model in one 
instance of the stable semantics is also a model in each more precise instance of the 
stable semantics. This shows that all instances of each of the three families of seman- 
tics basically formalize the same intuitions but with different degree of precision, 
with the semantics based on the ultimate operator as the most precise. However, 
for several important subclasses of aggregate programs, optimal precision is reached 
using weaker approximations. As a general rule, when the well-founded model of one 
approximation is two-valued, then it coincides with the well-founded and unique 
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stable model of each more precise approximation. In Theorem 17.31 and Corollary 
16.11 we proved that for definite aggregate programs, the least fixpoint semantics, 
the w^-well-founded, ult-st&ble, the ultimate well-founded and the ultimate sta- 
ble semantics all coincide. One example is the Company Control Example 15. 21 For 
stratified aggregate programs, Theorem 17.21 showed that the standard model, the 
tnu-well-founded and triv-stable model and the ultimate well-founded and stable 
model all coincide. This class was illustrated by the first Shortest Path Example 
17.41 In case of the second Shortest Path Example l7.4l Proposition l7.19l showed that 
(under some weak conditions) the liZi-well-founded model is two-valued. 

This raises the question of what a good choice of the semantics is and how we can 
exploit our results to build an effective reasoning system for aggregate programs. 
Just as for standard logic programming, the family of semantics of aggregate pro- 
grams offers a trade-off between precision versus complexity. Therefore, choosing the 
"right" semantics is a pragmatic matter and the relevant question is what degree 
of precision is required for the applications that one has in mind. 

In the case of logic programs without aggregates, the standard and the ultimate 
semantics only differ for programs with a rare combination of reasoning by cases and 
recursion, as in {p <— p V We are not aware of a single non-artificial example 
of a logic program, appearing in the literature, which shows this behaviour. This 
means that, de facto, the standard and the ultimate semantics coincide and the 
ultimate semantics can be computed using the more efficient Fitting operator. In 
case of aggregate programs, the situation seems even better. It must be admitted 
that applications of recursion over aggregates in the literature are quite rare. We 
believe that this is not a coincidence. Indeed, aggregates provide a rather powerful 
way to avoid recursion. In a language without aggregates, the means to compute 
the value of some aggregate of a set of objects (e.g., cardinality, sum, minimum 
or maximum, . . . ) is often by using recursion over the potential elements of the 
set l|Van Gelder 1992(l . By allowing aggregates in the language, such applications 
of recursion can be avoided. Recall that for stratified aggregates programs even 
the iriu-stable and irif-well-founded semantics are sufficiently precise. As for the 
applications of recursion over aggregates in this paper, the Party Invitation program 
of Example 15. II is to be interpreted under the supported model semantics, whereas 
for the second Shortest Path Example 17 . 41 and the Company Control Example 15. 21 
the uZt-well-founded model is the correct solution. 

In summary, we believe that extensions of the standard semantics with polyno- 
mially computable three-valued aggregates, provide sufficient precision for almost 
all practical applications. This is the case for the ultimate approximating aggre- 
gates of card, min, MAX and the bound approximating aggregates of SUM and 
PROD l|Pelov 20 04). To us, it seems that the most suitable approximation for being 
incorporated in model generation or query systems for aggregate programs, is the 
extension of the Fitting operator with those three-valued aggregates. 
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8 Related Work 

Aggregate relations are closely related to generalized quantifiers (Lindstrom 1966). 
An example of a generalized quantifier is MOSt(A,B), defined on page El which 
expresses that most elements of set A belong to set B. Clearly, this relation can 
be viewed as a binary aggregate relation with two set arguments. Standardly, the 
notion of generalized quantifier is formalized in a slightly more involved way than 
as a second order predicate in an arbitrary domain. Instead the concept is formal- 
ized as a class of structures closed under isomorphism. For example, MOST could 
be formalized as the class of all structures consisting of a domain D and a binary 
relation M C p(D) x p(D) consisting of all pairs (A, B) such that A is finite and 
more than half of the elements of A belong to B. In this way, a domain indepen- 
dent characterization of the generalized quantifier is obtained. Aggregates could 
be formalized similarly. For example, the cardinality aggregate could be formal- 
ized as the class of all structures consisting of a domain D and a binary relation 
C C p(D) x N containing all tuples (£, n) such that S is a subset of D containing n 
elements. An extensive study of generalized quantifiers in three- valued logic is done 



by van Eijck (1996). The notion of a super-valuation interpretation of a generalized 
quantifier, as defined there, coincides with our notion of ultimate approximating 
aggregate of the corresponding aggregate relation. 

In the context of logic programming, many different approaches to aggregates 
have been proposed. Below, we discuss a selection of them. 

The class of monotonic aggregate programs (Mu mick et al. 1990(1 is very similar 
to the class of definite aggregate programs. A monotonic program is a program in 
which every rule is monotonic. A monotonic rule is a rule r such that / \= body(r) 
and / C J implies J |= body(r) for any pair of interpretations / and J. Although this 
is a semantic definition of monotonicity, the authors introduce a sufficient syntactic 
condition for monotonicity of a rule. Essentially, an aggregate atom can appear only 
in formulas of the form 

3zr({x I q(x)}, z) A p(z, t) (1) 

where p is a pre-defined relation. Moreover, the satisfiability of this formula must 
be monotone. In our syntax Q) can be expressed as the aggregate atom 

Rp({x\q(x)},t) (2) 

using the derived aggregate relation Rp. Then, the condition that the satisfiability 
of Q is monotone is equivalent to the condition that Rp is a monotone aggregate 
relation (and consequently J2J) is a positive aggregate atom) . The notion of positive 
aggregate atoms is simpler and, in our opinion, more natural than the condition of 
monotonic literals of IjMumick et al. 1990|l . 

A common approach to extend the stable semantics of general logic programs 
with negation to aggregate programs was to treat aggregate literals as negative lit- 
erals when computing the program reduct QKemp and Stuckey 1991| IGelfond 20021 
Elka bani et al. 2004J) . Such semantics is relatively easy to define and the defini- 
tion also extends to answer sets of disjunctive logic programs. However, it has been 
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shown that this semantics accepts non-minimal models and does not model correctly 
some problems with recursion over aggregation, for example the Company Controls 
program from Example IS"2l IjPelov 20041 Section 4.3.3). For aggregate programs 
containing only negative aggregate literals it has been shown IjPelov et al. 2004|) 
that the set of tt/i-stables models is the same as the set of stable models defined by 
the above authors. 

A more elaborate definition of a stable semantics was given by ((Simons et al. 2002|l 
for programs with weight constraints and implemented by the well-known smodels 
system. In our language, weight constraints correspond to aggregate atoms build 
with the SUM< and SUM> aggregate relations. An extensive comparison between 
the ult-stable semantics and the stable semantics of weight constraints can be found 
in l|Pelov et al. 20041 IPelov 2004|) and will not be repeated here. 

A novel feature of the language of weight constraints was that it allows weight con- 
straints to be present also in the head of the rules. This approach have been further 
developed in different directions. One line of research was to consider different varia- 
tions and extensions of weight constraints like abstract constraints | |Marek and Truszczyhski 2004| ), 
monotone cardinality atoms IjMarek et al. 200l| or set constraints QMarek and Remmel 2004jl . 
Such constraint atoms correspond in a natural way to aggregate atoms. The stable 
semantics of these extensions is also defined in terms of lattice operators. How- 
ever, since constraint atoms are allowed in the heads of rules, the operators become 
non-deterministic and the algebraic theory is quite different than the approximation 
theory we used in this work. However, all the semantics agree on the class of definite 
aggregate programs and its least model semantics. The equivalent of a definite logic 
program in IjMarek and Remmel 2004|) is called a Horn SC-logic programs and such 
programs are also characterized by a unique model which is the least fixpoint of a 
deterministic monotone operator Sp which is the equivalent of our Tp 9 ^ r operator. 

Another extension of the language of weight constraint atoms is to considers 
arbitrary propositional formulas containing arbitrary aggregate atoms (both in the 
head and in the body of the rule) ((Ferraris 2005(1 . The answer set semantics for such 
propositional formulas is different than the 25-stable semantics which we defined. 
The main reason is that the semantics of l|Ferraris 2005(1 is based on the logic of 
here-and-there (Lifschi tz et al. 20 011 while the Testable semantics are based on 
Kleene's strong three- valued logic TT-CTLEE. The simplest example of the difference 
between the two logics is that -i-ip is equivalent to p in TTilZSS while it is not in the 
logic of here-and-there. As a consequence, the program consisting of the single rule 
p <— -i-ip is a definite program according to Definition ^. fil because -i->p is a positive 
(aggregate) formula. So, all our semantics assign to this program the model On 
the other hand, it has two models and {p} according to the answer set semantics 
of ( Lifschi tz et al. 20011 IFerraris 2005jl . This difference also manifests for aggregate 
formulas. For example, it is easy to see that the aggregate formulas -iSUM^(s, t) and 
SUM(s,i) have the same three- valued truth value for any interpretation of SUM with 
a three- valued aggregate relation while they are not equivalent under the semantics 
of l(Ferraris 2005( 1. 

Another proposal for a stable semantics of disjunctive logic programs extended 
with aggregates was given in ((Faber et al. 2004|l . In the sequel we investigate in 
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more detail the relationship with this semantics to the family of 2?-stable se- 
mantics defined earlier. First, we recall the definitions of the stable semantics of 
IjFaber et al. 20040 . 

Definition 8.1 \Faber et al. 20041 

The reduct P 1 of an aggregate program P with respect to an interpretation I is 
a program obtained from inst(P) by deleting all rules in which a literal or an 
aggregate atom in the body is false in I. An interpretation / is a FLP-stable model 
of P if I is a minimal model of P 1 . 

In one direction we can show the following result. 

Proposition 8.1 

For any aggregate program P and for any three-valued structure T>, if an interpre- 
tation I is an ©-stable model of P then / is a FLP-stable model of P. 

Proof 

We will show that any Instable model / is also a X>-stable model of P 1 . It will then 
follow from Lemma |2.1I that / is a minimal pre-fixpoint of Tp 99 ^. Since the pre- 
fixpoints of Tp 99 ^ are exactly the models of P 1 , this will imply that / is a minimal 
model of P 1 and hence, / is a FLP-stable model of P. We start by showing 

VJ C I: {^f{J,I) = {* a °£)\j,I). (*) 

This will imply that rfpCC^ 9 /^) 1 > -0) = lfp((*p^') 1 (->-0) = I, so I will be a 
stable model of P 1 . To show (0 we only need to look at the rules r € P — P 1 and 
show that H^., (body(rj) — f. The definition of reduct implies that for such a 

rule Ti.j)^j^(body(r)) = f where T> is the (two-valued) structure approximated by T>. 
We also have that (J, I) < p (I, I) which implies that 

Hl w) {body{r)) < H^ {I I) (body(r)) = H v(I) {body(r)) = f 

So,H± (JI) (body(r))=f. □ 

The next example gives a program for which the two semantics disagree. 

Example 8.1 

Consider the following aggregate program P: 

r <— CARD^({x | p(x)}, 1). 
p(A) <- r. 
p(B) <- r. 
p(A)<-p(B). 
p(B)<-p{A). 

The program has only one model M = {r, p(A),p(B)}. This is also a FLP-stable 
model because P M = P and M is also a minimal model of P. However, the program 
does not have an ultimate total stable model and, consequently, it does not have a 
total stable model for any less precise approximating operator. 
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To illuminate what is going on in this example, let us make the following ob- 
servation. In the context of the Herbrand universe {A, B}, the aggregate atom 
CARD^({x | p{x)}, 1) expresses that p has either zero or two elements. Or, in each 
two- valued Herbrand interpretation this atom is equivalent to 

(^p(A) A -<p(B)) V (p(A) A p{B)). (3) 

In fact, if we interpret CARD^ by the ultimate approximating aggregate, then a 
simple case analysis shows that, in each three- valued Herbrand interpretation, the 
truth values of the aggregate atom and of the formula © are identical 3 . It follows 
that the program P and the program obtained by substituting the rules 

r <- ^p{A) A ^p(B). 
r ^p(A) Ap(B). 

for the first rule in P, have identical three- valued immediate consequence operators 
and hence, have identical stable models 4 . The second program is a standard logic 
program and it is easy to see that it has no stable models. In particular, if wc 
compute the reduct under M = {r, p(A),p(B)}, only the second new rule remains 
and together with the rest of the rules the least model is 0. Thus M is not a stable 
model of the translated program. □ 

The above example illustrates a natural principle of the semantics defined in our 
framework. Substituting an aggregate atom by an aggregate free formula which is 
equivalent with respect to two-valued semantics, preserves ultimate well-founded 
and ultimate stable models. Substituting an aggregate atom by an aggregate free 
formula which is equivalent with respect to three-valued semantics preserves the 
standard well-founded and stable semantics. As shown by the example, this natural 
principle is not satisfied by the semantics defined in IfFaber et al. 2 004). This is a 
fundamental weakness of this semantics. 

Finally, we mention another recent work on defining a stable semantics of ag- 
gregates l|Son and Pontelli"2 007). As shown in that paper, the semantics defined 
there coincides with the uZt-stable semantics and provides an interesting different 
formalization for this semantics. 

9 Conclusions and Future Work 

In this paper we presented a systematic extension of all major semantics of logic pro- 
grams to aggregate programs: the least fixpoint semantics (van Emd en and Kowalski 1976|) . 
the standard model of stratified programs ( |Apt et al. 1988| ), the supported model 
semantics QApt et al. 1988| , the Kripke-Kleene semantics ( |Fitting 1985| ), the stable 
model semantics iGelfond and Lifschitz 1 988). the well-founded semantics l|Van Gelder et al. 1991fl . 

3 The three- valued equivalence of it/t(CARD^) and J2J is an app lication of a general transformation 
of ultimate approximations of aggregate atoms to formulas (Pelov 2004 Section 5.3.6). 

4 Stronger, both three-valued and two-valued immediate consequence operators coincide, and 
hence, all semantics of the two programs based on these operators coincide, including ult- 
Kripke-Kleene, uit-stable and uit-wcll-founded semantics. 
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and the ultimate stable and ultimate well-founded semantics (IDenecker et al. 2 004 ) . 
The extension of the stable and well-founded semantics is not unique but is pa- 
rameterized by how aggregate relations are extended to three-valued relations on 
three-valued sets. We studied three instances of these semantics. Two of them are 
the least precise (called triv) and the most precise (called ult) in this family and 
they are defined in a uniform way for all aggregate relations. The third instance 
(called bnd) is defined only for aggregate functions on totally ordered sets. For some 
aggregates, most notably SUM and PROD the 6n<i-seniantics is strictly less precise 
than the ult semantics. Although, we did not present here a full complexity analy- 
sis, the advantage of the 6n<i-semantics over the uZt-semantics is that it has a lower 
complexity (Pclov 2004). 

We also showed that all important properties and relationships of the original se- 
mantics are preserved in the extension. For example, a stratified aggregate program 
P has a two-valued 2?-well-founded model which is equal to the unique 25-stablc 
model of P and the standard model of P for any three-valued interpretation T> of 
the aggregate relations. A similar result for a definite aggregate program P is that 
the M^-well-founded model is equal to the unique ult-stable model and the least 
fixpoint model of P. Another important property of all stable semantics which we 
define is that stable models are always minimal models. 
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Appendix A Proof of the Shortest Path Theorem 

Before proving Proposition 17.191 we first introduce some concepts. 

We define the length of a path as its number of edges. We define a partial function 
sp(., .) as follows: sp(a,b) is defined if and only if there is a shortest path from a 
to b and sp(a, b) = w where w is the weight of the shortest path. Note that it is 
possible that there is a path from a to b while there is not a shortest path. E.g., 
consider the graph with edges {(a, a, —1), (a, b, 1)}. It has no shortest paths. 

Proposition \7.19\ 

Let edge be a graph with the property that for any pair of nodes a and b, if there 
is a path from a to b, there is a shortest path from a to b. In the itZt-well-founded 
model of the shortest path program from Example 17.41 an atom sp(a,b,w) is: 

• true if a shortest path between a and b exists and has weight w; 

• false otherwise. 

Proof 

We will compute the u/i-well-founded model of P by an alternating fixpoint com- 
putation using the sequences (I n )n<EN and (J n )n<EN which are defined by mutual 
induction: 

• Iq = J-; 

• J n = Sti p (I n )=\fp(<S> 2 P (I n ,-)); 

• I n = Stip(Jn-l) = lf P( $ p(-» Jn-l))- 

We now construct this sequence until we reach a fixpoint. 

1. We show that J = lfp($^(_L, ■)) = C U S where 

C = {cp(a, b, w) | there is a path from a to b with weight w} 
S = {sp(a, b, w) | there is a path from a to 6 with weight w}. 

In the first iteration $%(±, A.) = Tp 9 % r (L) = C x where 

C\ = {cp(a,b,w) | edge(a,b,w) £ edge }. 
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In the next iteration, the value of cp/3 will not change because the value of sp/3 
has not changed. On the other hand cp/3 has changed, so sp/3 will change now. 
First, let us compute the value of the set expression for values a and b: 

{{c\ cp(a,b,c)}} { ±, Cl ) = (®,W ab ) 

where W a _b = {w | (a, b, w) E edge®}, i.e., there is a path between a and b of length 
1. According to TableEJin Section [7^1 it holds that m^(min) 2 ((0, W a b), x) is true 
for all x G W a b- So, we obtain $p(_L, Ci) = C x U Si where 

Si = {sp(a,b,w) | edge(a,b,w) £ edge }. 

Continuing this process we can show by induction that for every positive integer 
number n > we have 

*f,(-L,-)T(2n-l)= \J QU \J St 

l<z<n l<i<n 

$?,(±,.)T(2n)= |J |J S. ( 

l<2<n l<z<n 

where 

Ci = {cp(a, 6, u>) | there is a path from a to 6 of length i and weight u>} 
Si = {sp(a, b, w) | there is a path from a to 6 of length i and weight w}. 
At the first limit ordinal u> we have 

$f,(_L, •) t u = (J Ci U |J 5, = C U 5. 

Hence, after this first step, we have computed in Jo all possible path weights between 
any two pairs of points a and b. 
2. Next we show that h = lfp($^(-, C U S)) = CE U SP where: 

SP = {sp(a, b, w) | there is a shortest path from a to 6 of weight w} 

CE = {cp(a, b, w) | (a, 6, € edge v or 

3c, uii : sp(a, c, wi) 6 SP and (c, 6, to — Wi) £ edge 15 } 

Define also the following sets for every i > 1: 

SPi = {sp(a, 6, w) | there is a shortest path from a to 6 of length i and weight in } 

CEi = {cp(a,b,w) | (a,b,w) S edge® or 

i > 1 and 3c, wi : sp(a, c, u?i) £ SPi-i and (c, 6, k; — w{) G edge®} 

Note that SP = U lGN and CE = U eN 
For the first iteration we verify that 

$},(1,CU5) =Ci = CEi. 

To see why this is the case, we compute the value of the set expression: 

\{c | cp(a,b,c)}}( ±iC us) = {%,A ab ) 

where 

A a b = {w | there is a path from a to & of length w}. 
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Further, u/i(MiN) 1 ((0, A a b), w) is false for every weight w (see Table |3] of Sec- 
tion 1^3). So, the interpretation of sp/3 will be the empty set. The interpretation 
of cp/3 will be the same as the edge relation. Hence we obtain the set CE\. 
On the next iteration only the value of sp/3 will change. We have 

[{c | cp(a,b,c)}\ C E u CuS) = (W ab ,A ab ) 

and {{W a b, A a i,), w) G ult(Min) 1 if w £ W ab and the shortest path between a and 
b has weight u>. So 

§ P {CE 1 , C U 5) = CEi U SP X . 

On the next iteration, the cp/3 relation becomes the composition of SP\ with the 
edge relation: 

^(CEi U SP U C U 5) = OEx U CP 2 U SPi. 

In the next step we compute CEi U CP2 U SPi U SP2, i.e., we obtain all shortest 
paths of at most length 2. Continuing this process we obtain a fixpoint which is 

<$> P {CE U SP, C U 5) = CE U SP. 

At this stage, we have found all shortest paths. 

3. Next, we show that J l = ]£p($ p (CE U SP, •)) = CEUSP, i.e., <&p(CE U 5P, CE U 
SP) = CEUSP. 

It is straightforward to verify that cp(a, 6, w) <E $p(C£ U SP, CE U SP) if and only 
if cp(a, b, w) 6 CP. As for sp{a, b, w), we first have to consider the value of the set 
expression 

{{c | cp(a, b, c)}l {CEuS p iCEvS p) = (B ab ,B ab ) 

where 

B a b = W a b U {w | 3c, w\ : sp(a, c,w\) G SP A (c, 6, to — wi) G edge }. 

Either there is no path from a to b, in which case B a b is empty, and its minimum 
undefined. In this case, no sp{a, b, w) atom is derived. Or, there is a path from a 
to b. Then by the assumption of the proposition, there is a shortest path from a to 
b, say with weight w. This minimal path is either an edge from a to b or it is an 
extension of a shortest path from a to some vertex c. In both cases, w belongs to 
the set B a b and is its least element. In this case, sp(a,b,w) is derived. Hence 

<t> P {CE U SP, CE U SP) = CE U SP. 

4. Since J\ — I\, it follows that 1% = Ji = I\ and that we have reached a fixpoint 
which is the two- valued u/i-well-founded model that was to be proven. 

□ 

The following example shows that the condition in the proposition is essential 
for the proof. 



Example A.l 
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Consider the following graph {(a, a, — 1), (a, b, 0)}. It is easy to see that there are 
no shortest paths because of the cycle in a. Hence, SP = 0. 

We compute the w^-well-foundcd model of P by an alternating fixpoint compu- 
tation. The first three steps are exactly as in the proof. Things only change in the 
computation of J\. For this step, the proof exploited the fact that all connected 
pairs have a shortest path, but this assumption does not hold anymore. We have: 

• 7 = ±. 

• Jo — {sp( a , a , — n— 1), sp(a, b, —n),cp(a, a, — n— 1), cp(a, b, —n) \ n 6 N}. This 
describes all paths in the graph. 

• Ji = CE U SP = {cp(a, a, —l),cp(a, b, 0)}. Indeed, since there are no shortest 
paths in this graph, the set SP is empty and CE is just a copy of the edge 
relation. 

• J i — lfp (Qp(CE, •)). The computation is entirely similar to the fixpoint com- 
putation of Jo (see the proof of the proposition). Define 

Cj = {cp(a, a, — n — 1), cp(a, b, —n) \ n € [0, i]} 

Si = {sp(a, a, -n),sp(a, b, -n + 1) n 6 [0, 

Note that CE = C U S a . First, we compute § 2 P {CE,CE). Since there are 
no true sp atoms, the computed cp atoms correspond to the edges. Hence, 
cp remains unchanged. As for sp, its rule derives the atoms sp(a,a, —1) and 
sp(a,b, 0), i.e., we obtain Co U Si. In the next iteration, since cp did not 
change, sp remains unaltered. Now cp is extended by composing Si with the 
edge relation. We obtain C\. In the next iteration, cp remain identical, and 
now sp will be extended to obtain S*2. In general we have the same fixpoint 
computation as for J except that we start at CE rather than at _L. It holds 
that 

& P (CE, •) T (2n) - C n U S n 
® 2 P (CE,-) t (2n + 1) = C n U S n+ i 

The limit of this sequence, Ji, is equal to Jo- Therefore, I 2 will be equal to Ii, 
so we reached the well-founded fixpoint which is (ii, Jo). Since 1\ 7^ Jo, this 
is a three-valued model. For example, for each negative integer n, the atom 
sp(a, b, n) is unknown. □ 



